This book presents a thorough overview of fusion in computer vision, from an interdisciplinary and multi-application viewpoint, describing successful approaches, evaluated in the context of international benchmarks that model realistic use cases. Features: examines late fusion approaches for concept recognition in images and videos; describes the interpretation of visual content by incorporating models of the human visual system with content understanding methods; investigates the fusion of multi-modal features of different semantic levels, as well as results of semantic concept detections, for example-based event recognition in video; proposes rotation-based ensemble classifiers for high-dimensional data, which encourage both individual accuracy and diversity within the ensemble; reviews application-focused strategies of fusion in video surveillance, biomedical information retrieval, and content detection in movies; discusses the modeling of mechanisms of human interpretation of complex visual content.
Les mer
This book presents a thorough overview of fusion in computer vision, from an interdisciplinary and multi-application viewpoint, describing successful approaches, evaluated in the context of international benchmarks that model realistic use cases.
Les mer
A Selective Weighted Late Fusion for Visual Concept Recognition.- Bag-of-Words Image Representation: Key Ideas and Further Insight.- Hierarchical Late Fusion for Concept Detection in Videos.- Fusion of Multiple Visual Cues for Object Recognition in Video.- Evaluating Multimedia Features and Fusion for Example-Based Event Detection.- Rotation-Based Ensemble Classifiers for High Dimensional Data.- Multimodal Fusion in Surveillance Applications.- Multimodal Violence Detection in Hollywood Movies: State-of-the-Art and Benchmarking.- Fusion Techniques in Biomedical Information Retrieval.- Using Crowdsourcing to Capture Complexity in Human Interpretations of Multimedia Content.
Les mer
Visual content understanding is a complex and important challenge for applications in automatic multimedia information indexing, medicine, robotics, and surveillance. Yet the performance of such systems can be improved by the fusion of individual modalities/techniques for content representation and machine learning.This comprehensive text/reference presents a thorough overview of Fusion in Computer Vision, from an interdisciplinary and multi-application viewpoint. Presenting contributions from an international selection of experts, the work describes numerous successful approaches, evaluated in the context of international benchmarks that model realistic use cases at significant scales.Topics and features:Examines late fusion approaches for concept recognition in images and videos, including the bag-of-words modelDescribes the interpretation of visual content by incorporating models of the human visual system with content understanding methodsInvestigates the fusion of multi-modal features of different semantic levels, as well as results of semantic concept detections, for example-based event recognition in videoProposes rotation-based ensemble classifiers for high-dimensional data, which encourage both individual accuracy and diversitywithin the ensembleReviews application-focused strategies of fusion in video surveillance, biomedical information retrieval, and content detection in moviesDiscusses the modeling of mechanisms of human interpretation of complex visual contentThis authoritative collection is essential reading for researchers and students interested in the domain of information fusion for complex visual content understanding, and related fields.
Les mer
Examines information fusion in the context of multimodal and multidimensional data representation, i.e., video, image and text Presents a focus on information fusion for tackling higher-level description of multimedia information Discusses the latest research on a broad range of multimedia information fusion techniques Includes supplementary material: sn.pub/extras
Les mer

Produktdetaljer

ISBN
9783319056951
Publisert
2014-04-10
Utgiver
Vendor
Springer International Publishing AG
Høyde
235 mm
Bredde
155 mm
Aldersnivå
Research, P, 06
Språk
Product language
Engelsk
Format
Product format
Innbundet

Biographical note

Dr. Bogdan Ionescu is a lecturer and Coordinator of the Video Processing Group at the Image Processing and Analysis Laboratory, University Politehnica of Bucharest, Romania. Dr. Jenny Benois-Pineau is a full professor and Chair of the Video Analysis and Indexing research group at the University of Bordeaux, France. Dr. Tomas Piatrik is a senior researcher in the Multimedia and Vision Research Group at Queen Mary University of London, UK. Dr. Georges Quénot is a senior researcher at CNRS and leader of the Multimedia Information Modeling and Retrieval group at the Grenoble Informatics Laboratory, France.