We regret to announce that the workshop has been cancelled.
Everyday, thousands of videos are uploaded into the web creating an ever-growing demand for methods to make them easier to retrieve,search, and index. While visual information is a very important part of a video, acoustic information often complements it. This is especially true for the analysis of consumer-produced, unconstrained videos from social media networks, such as YouTube uploads or Flickr content.
The diversity in content, recording equipment, environment, quality, etc. poses significant challenges to the current state of the art in multimedia and audio analytics. The fact that this data is from non-professional and consumer sources means that it often has little or no manual labeling. Large-scale multi-modal analysis of audio-visual material can help overcome this problem, and provide training and testing material across modalities for language understanding, human action recognition, and scene identification algorithms, with applications in robotics, interactive agents, etc. Speech and audio provide a natural modality to summarize and interactwith the content of videos. Therefore, speech and audio processing is critical for multimedia analysis that goes beyond traditional classification and retrieval applications.
The goal of the 2nd ACM International Workshop on Audio and Multimedia Methods for Large-Scale Video Analysis (AMVA) is to bring together researchers and practitioners in this newly emerging field, and to foster discussion on future directions of the topic by providing a forum for focused exchanges on new ideas, developments, and results. The aim is to build a strong community and a venue tha at some point can become its own conference.