default search action
22nd ACM Multimedia 2014: Orlando, FL, USA
- Kien A. Hua, Yong Rui, Ralf Steinmetz, Alan Hanjalic, Apostol Natsev, Wenwu Zhu:
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03 - 07, 2014. ACM 2014, ISBN 978-1-4503-3063-3
Keynote 1
- Harry Shum:
Bing, the fastest growing image search engine. 1
Keynote 2
- Rosalind W. Picard:
Affective media and wearables: surprising findings. 3-4
Keynote 3
- Klara Nahrstedt:
Back and to the future: quality provisioning for multimedia content delivery. 5
Best Paper Session
- Fangxiang Feng, Xiaojie Wang, Ruifan Li:
Cross-modal Retrieval with Correspondence Autoencoder. 7-16 - AmirHossein Habibian, Thomas Mensink, Cees G. M. Snoek:
VideoStory: A New Multimedia Embedding for Few-Example Recognition and Translation of Events. 17-26 - Yelin Kim, Emily Mower Provost:
Say Cheese vs. Smile: Reducing Speech-Related Variability for Facial Emotion Recognition. 27-36
Multimedia Art and Entertainment
- Javier Villegas, Angus Graeme Forbes:
Analysis/synthesis approaches for creatively processing video signals. 37-46 - Sicheng Zhao, Yue Gao, Xiaolei Jiang, Hongxun Yao, Tat-Seng Chua, Xiaoshuai Sun:
Exploring Principles-of-Art Features For Image Emotion Recognition. 47-56 - Jiajia Li, Grace Ngai, Stephen Chi-fai Chan, Kien A. Hua, Hong Va Leong, Alvin T. S. Chan:
From Writing to Painting: A Kinect-Based Cross-Modal Chinese Painting Generation System. 57-66 - Charles Roberts, Matthew Wright, JoAnn Kuchera-Morin, Tobias Höllerer:
Gibber: Abstractions for Creative Multimedia Programming. 67-76
Action, Activity, and Event Recognition
- Zhigang Ma, Yi Yang, Nicu Sebe, Alexander G. Hauptmann:
Multiple Features But Few Labels?: A Symbiotic Solution Exemplified for Video Analysis. 77-86 - Chengcheng Jia, Yu Kong, Zhengming Ding, Yun Raymond Fu:
Latent Tensor Transfer Learning for RGB-D Action Recognition. 87-96 - Keze Wang, Xiaolong Wang, Liang Lin, Meng Wang, Wangmeng Zuo:
3D Human Activity Recognition with Reconfigurable Convolutional Neural Networks. 97-106 - Pei Xu, Mao Ye, Xue Li, Qihe Liu, Yi Yang, Jian Ding:
Dynamic Background Learning through Deep Auto-encoder Networks. 107-116
Music, Speech and Audio
- Bin Wu, Erheng Zhong, Andrew Horner, Qiang Yang:
Music Emotion Recognition by Multi-label Multi-layer Multi-instance Multi-view Learning. 117-126 - Kuang Mao, Ju Fan, Lidan Shou, Gang Chen, Mohan S. Kankanhalli:
Song Recommendation for Social Singing Community. 127-136 - Hervé Bredin, Anindya Roy, Nicolas Pécheux, Alexandre Allauzen:
"Sheldon speaking, Bonjour!": Leveraging Multilingual Tracks for (Weakly) Supervised Speaker Identification. 137-146 - Kai Li, Jun Ye, Kien A. Hua:
What's Making that Sound? 147-156
Deep Learning for Multimedia
- Ji Wan, Dayong Wang, Steven Chu-Hong Hoi, Pengcheng Wu, Jianke Zhu, Yongdong Zhang, Jintao Li:
Deep Learning for Content-Based Image Retrieval: A Comprehensive Study. 157-166 - Zuxuan Wu, Yu-Gang Jiang, Jun Wang, Jian Pu, Xiangyang Xue:
Exploring Inter-feature and Inter-class Relationships with Deep Neural Networks for Video Classification. 167-176 - Tianjun Xiao, Jiaxing Zhang, Kuiyuan Yang, Yuxin Peng, Zheng Zhang:
Error-Driven Incremental Learning in Deep Convolutional Neural Network for Large-Scale Image Classification. 177-186 - Hanwang Zhang, Yang Yang, Huan-Bo Luan, Shuicheng Yan, Tat-Seng Chua:
Start from Scratch: Towards Automatically Identifying, Modeling, and Naming Visual Attributes. 187-196
Multimedia Grand Challenge
- Shintami Chusnul Hidayati, Kai-Lung Hua, Wen-Huang Cheng, Shih-Wei Sun:
What are the Fashion Trends in New York? 197-200 - Yin-Hsi Kuo, Yan-Ying Chen, Bor-Chun Chen, Wen-Yu Lee, Chun-Che Wu, Chia-Hung Lin, Yu-Lin Hou, Wen-Feng Cheng, Yi-Chih Tsai, Chung-Yen Hung, Liang-Chi Hsieh, Winston H. Hsu:
Discovering the City by Mining Diverse and Multimodal Data Streams. 201-204 - Jan Zahálka, Stevan Rudinac, Marcel Worring:
New Yorker Melange: Interactive Brew of Personalized Venue Recommendations. 205-208 - Rajiv Ratn Shah, Yi Yu, Anwar Dilawar Shaikh, Suhua Tang, Roger Zimmermann:
ATLAS: Automatic Temporal Segmentation and Annotation of Lecture Videos Based on Modelling Transition Time. 209-212 - Brendan Jou, Subhabrata Bhattacharya, Shih-Fu Chang:
Predicting Viewer Perceived Emotions in Animated GIFs. 213-216 - Yogesh Singh Rawat, Mohan S. Kankanhalli:
Context-Based Photography Learning using Crowdsourced Images and Social Media. 217-220 - Mei-Chen Yeh, Hsiao-Wei Lin:
Virtual Portraitist: Aesthetic Evaluation of Selfies Based on Angle. 221-224 - Jian Wang, Cuicui Kang, Yonghao He, Shiming Xiang, Chunhong Pan:
Cross Modal Deep Model and Gaussian Process Based Model for MSR-Bing Challenge. 225-228 - Yalong Bai, Wei Yu, Tianjun Xiao, Chang Xu, Kuiyuan Yang, Wei-Ying Ma, Tiejun Zhao:
Bag-of-Words Based Deep Neural Network for Image Retrieval. 229-232 - Yingwei Pan, Ting Yao, Xinmei Tian, Houqiang Li, Chong-Wah Ngo:
Click-through-based Subspace Learning for Image Search. 233-236
Multimedia HCI and QoE
- Luming Zhang, Yue Gao, Chao Zhang, Hanwang Zhang, Qi Tian, Roger Zimmermann:
Perception-Guided Multimodal Feature Fusion for Photo Aesthetics Assessment. 237-246 - Hiromi Nemoto, Philippe Hanhart, Pavel Korshunov, Touradj Ebrahimi:
Impact of Ultra High Definition on Visual Attention. 247-256 - Jiangyang Zhang, C.-C. Jay Kuo:
An Objective Quality of Experience (QoE) Assessment Index for Retargeted Images. 257-266 - Wei Song, Dian Tjondronegoro, Ivan Himawan:
Acceptability-based QoE Management for User-centric Mobile Video Delivery: A Field Study Evaluation. 267-276
Multimedia Analysis and Mining
- Wenxuan Xie, Yuxin Peng, Jianguo Xiao:
Weakly-Supervised Image Parsing via Constructing Semantic Graphs and Hypergraphs. 277-286 - Xiaopeng Zhang, Hongkai Xiong, Wengang Zhou, Qi Tian:
Fused one-vs-all mid-level features for fine-grained visual categorization. 287-296 - Wei Zhang, Hongzhi Li, Chong-Wah Ngo, Shih-Fu Chang:
Scalable Visual Instance Mining with Threads of Features. 297-306 - Yanfei Wang, Fei Wu, Jun Song, Xi Li, Yueting Zhuang:
Multi-modal Mutual Topic Reinforce Modeling for Cross-media Retrieval. 307-316
Multimedia Systems
- Vengatanathan Krishnamoorthi, Niklas Carlsson, Derek L. Eager, Anirban Mahanti, Nahid Shahmehri:
Quality-adaptive Prefetching for Interactive Branched Video using HTTP-based Adaptive Streaming. 317-326 - Benjamin Rainer, Christian Timmerer:
Self-Organized Inter-Destination Multimedia Synchronization For Adaptive Media Streaming. 327-336 - Kiana Calagari, Krzysztof Templin, Tarek Elgamal, Khaled M. Diab, Piotr Didyk, Wojciech Matusik, Mohamed Hefeeda:
Anahita: A System for 3D Video Streaming with Depth Customization. 337-346 - Li Lin, Xiaofei Liao, Guang Tan, Hai Jin, Xiaobin Yang, Wei Zhang, Bo Li:
LiveRender: A Cloud Gaming System Based on Compressed Graphics Streaming. 347-356
Emotional and Social Signals in Multimedia
- Enver Sangineto, Gloria Zen, Elisa Ricci, Nicu Sebe:
We are not All Equal: Personalizing Models for Facial Expression Analysis with Transductive Parameter Transfer. 357-366 - Tao Chen, Felix X. Yu, Jiawei Chen, Yin Cui, Yan-Ying Chen, Shih-Fu Chang:
Object-Based Visual Sentiment Concept Analysis and Application. 367-376 - Florian Lingenfelser, Johannes Wagner, Elisabeth André, Gary McKeown, William Curran:
An Event Driven Fusion Approach for Enjoyment Recognition in Real-time. 377-386 - John R. Zhang, Jason Sherwin, Jacek Dmochowski, Paul Sajda, John R. Kender:
Correlating Speaker Gestures in Political Debates with Audience Engagement Measured via EEG. 387-396
High Risks High Rewards
- Michael Riegler, Martha A. Larson, Mathias Lux, Christoph Kofler:
How 'How' Reflects What's What: Content-based Exploitation of How Users Frame Social Images. 397-406 - Miaojing Shi, Teddy Furon, Hervé Jégou:
A Group Testing Framework for Similarity Search in High-dimensional Spaces. 407-416 - Eva Mohedano, Graham Healy, Kevin McGuinness, Xavier Giró-i-Nieto, Noel E. O'Connor, Alan F. Smeaton:
Object Segmentation in Images using EEG Signals. 417-426 - Oche Ejembi, Saleem N. Bhatti:
Help Save The Planet: Please Do Adjust Your Picture. 427-436
Multimedia Applications
- Kenta Kusumoto, Teemu Kinnunen, Jari Kätsyri, Heikki Lindroos, Pirkko Oittinen:
Media Experience of Complementary Information and Tweets on a Second Screen. 437-446 - Pradeep Kumar Jayaraman, Chi-Wing Fu:
Interactive Line Drawing Recognition and Vectorization with Commodity Camera. 447-456 - Xin Lu, Zhe Lin, Hailin Jin, Jianchao Yang, James Z. Wang:
RAPID: Rating Pictorial Aesthetics using Deep Learning. 457-466 - Si Liu, Xiaodan Liang, Luoqi Liu, Ke Lu, Liang Lin, Shuicheng Yan:
Fashion Parsing with Video Context. 467-476
Privacy, Health and Well-being
- Andrey Bogomolov, Bruno Lepri, Michela Ferron, Fabio Pianesi, Alex Pentland:
Daily Stress Recognition from Mobile Phone Data, Weather Conditions and Individual Traits. 477-486 - Shenggao Zhu, Robert J. Ellis, Gottfried Schlaug, Yee Sien Ng, Ye Wang:
Validating an iOS-based Rhythmic Auditory Cueing Evaluation (iRACE) for Parkinson's Disease. 487-496 - Zhan Qin, Jingbo Yan, Kui Ren, Chang Wen Chen, Cong Wang:
Towards Efficient Privacy-preserving Image Feature Extraction in Cloud Computing. 497-506 - Huijie Lin, Jia Jia, Quan Guo, Yuanyuan Xue, Qi Li, Jie Huang, Lianhong Cai, Ling Feng:
User-level psychological stress detection from social media using deep neural network. 507-516
Multimedia Search and Indexing
- Jianfeng Wang, Heng Tao Shen, Shuicheng Yan, Nenghai Yu, Shipeng Li, Jingdong Wang:
Optimized Distances for Binary Code Ranking. 517-526 - Yao Hu, Zhongming Jin, Hongyi Ren, Deng Cai, Xiaofei He:
Iterative Multi-View Hashing for Cross Media Indexing. 527-536 - Xiaopeng Yang, Tao Mei, Yongdong Zhang:
Rescue Tail Queries: Learning to Image Search Re-rank via Click-wise Multimodal Fusion. 537-546 - Lu Jiang, Deyu Meng, Teruko Mitamura, Alexander G. Hauptmann:
Easy Samples First: Self-paced Reranking for Zero-Example Multimedia Search. 547-556
Social Media and Crowd
- Ming Yan, Jitao Sang, Changsheng Xu:
Mining Cross-network Association for YouTube Video Promotion. 557-566 - Xue Geng, Hanwang Zhang, Zheng Song, Yang Yang, Huan-Bo Luan, Tat-Seng Chua:
One of a Kind: User Profiling by Social Curation. 567-576 - Axel Carlier, Lilian Calvet, Duong-Trung-Dung Nguyen, Wei Tsang Ooi, Pierre Gurdjos, Vincent Charvillat:
3D Interest Maps From Simultaneous Video Recordings. 577-586 - Prem Seetharaman, Bryan Pardo:
Crowdsourcing a Reverberation Descriptor Map. 587-596
Multimedia Recommendations
- Peng Cui, Zhiyu Wang, Zhou Su:
What Videos Are Similar with You?: Learning a Common Attributed Representation for Video Recommendation. 597-606 - Rajiv Ratn Shah, Yi Yu, Roger Zimmermann:
ADVISOR: Personalized Video Soundtrack Recommendation by Late Fusion with Heuristic Rankings. 607-616 - Shaowei Liu, Peng Cui, Wenwu Zhu, Shiqiang Yang, Qi Tian:
Social Embedding Image Distance Learning. 617-626 - Xinxi Wang, Ye Wang:
Improving Content-based and Hybrid Music Recommendation using Deep Learning. 627-636
Doctoral Symposium 1
- Mario Taschwer:
Medical case retrieval. 639-642 - Stefan Wilk, Wolfgang Effelsberg:
Mobile Video Broadcasting Services: Combining Video Composition and Network Efficient Transmission. 643-646 - David Grunberg:
Music-information retrieval in environments containing acoustic noise. 647-650 - Jeffrey J. Scott:
Automated Multi-Track Mixing and Analysis of Instrument Mixtures. 651-654
Doctoral Symposium 2
- Jichao Sun:
Local Selection of Features for Image Search and Annotation. 655-658 - Manfred Jürgen Primus:
Segmentation and Indexing of Endoscopic Videos. 659-662 - Desara Xhura:
Learning recognition of semantically relevant video segments from endoscopy videos contributed and edited in a private social network. 663-666 - Mario Guggenberger:
Multimodal Alignment of Videos. 667-670
Open Source Software Competition 1
- Xin Yang, Chong Huang, Kwang-Ting (Tim) Cheng:
libLDB: a library for extracting ultrafast and distinctive binary feature description. 671-674 - Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross B. Girshick, Sergio Guadarrama, Trevor Darrell:
Caffe: Convolutional Architecture for Fast Feature Embedding. 675-678 - Joan Alabort-i-Medina, Epameinondas Antonakos, James Booth, Patrick Snape, Stefanos Zafeiriou:
Menpo: A Comprehensive Platform for Parametric Image Alignment and Visual Deformable Models. 679-682
Open Source Software Competition 2
- Jack Jansen:
VideoLat: An Extensible Tool for Multimedia Delay Measurements. 683-686 - Matthijs Douze, Hervé Jégou:
The Yael Library. 687-690 - Giuseppe Becchi, Marco Bertini, Lorenzo Cioni, Alberto Del Bimbo, Andrea Ferracani, Daniele Pezzatini, Mathias Lux:
Loki+Lire: a framework to create web-based multimedia search engines. 691-694
Art Exhibit
- Parag Kumar Mital:
Audiovisual Resynthesis in an Augmented Reality. 695-698 - Charles Roberts:
Sound-Light Giblet. 699-700 - Michael Riegler, Mathias Lux, Christian Zellot, Lukas Knoch, Horst Schnattler, Sabrina Napetschnig, Julian Kogler, Claus Degendorfer, Norbert Spot, Manuel Zoderer:
Gone: an interactive experience for two people. 701-704 - Sarah Linebaugh:
Circles and Sounds. 705-708 - Yuan-Yi Fan:
Qi Visualizer: An Interactive Pulse Spectrogram Visualization using Mobile Participatory Biometrics. 709-712 - F. Myles Sciotto, Jean-Michel Crettaz:
Stoicheia: Architecture, Sound and Tesla's Apotheosis. 713-716 - Lonce Wyse:
States of Diffusion for n+1 devices. 717-719
Demos 1: Searching and Finding
- Julien Champ, Alexis Joly, Pierre Bonnet:
Fine-grained Visual Faceted Search. 721-722 - André F. Araújo, David M. Chen, Peter Vajda, Bernd Girod:
Real-time query-by-image video search system. 723-724 - Vamsidhar Reddy Gaddam, Ragnar Langseth, Håkon Kvale Stensland, Carsten Griwodz, Pål Halvorsen, Øystein Landsverk:
Automatic Real-Time Zooming and Panning on Salient Objects from a Panoramic Video. 725-726 - Hao-Kai Wen, Wei-Che Chang, Chia-Hu Chang, Yin-Tzu Lin, Ja-Ling Wu:
Event Detection in Broadcasting Video for Halfpipe Sports. 727-728 - Jianquan Liu, Shoji Nishimura, Takuya Araki:
Wally: A Scalable Distributed Automated Video Surveillance System with Rich Search Functionalities. 729-730 - Junshi Huang, Wei Xia, Shuicheng Yan:
Deep Search with Attribute-aware Deep Network. 731-732 - Rene Kaiser, Wolfgang Weiss, Manolis Falelakis, Marian Florin Ursu:
Virtual Director Adapting Visual Presentation to Conversation Context in Group Videoconferencing: An Interactive Demo. 733-734 - Jie Wu, Changhu Wang, Liqing Zhang, Yong Rui:
SmartVisio: Interactive Sketch Recognition with Natural Correction and Editing. 735-736
Demos 2: Senses and Sensors
- Nimesha Ranasinghe, Kuan-Yi Lee, Gajan Suthokumar, Ellen Yi-Luen Do:
Taste+: Digitally Enhancing Taste Sensations of Food and Beverages. 737-738 - Prem Seetharaman, Bryan Pardo:
Reverbalize: A Crowdsourced Reverberation Controller. 739-740 - Mark Cartwright, Bryan Pardo:
SynthAssist: an audio synthesizer programmed with vocal imitation. 741-742 - Yong-Xiang Wang, Li-Yun Lo, Min-Chun Hu:
Eat as much as you can: a kinect-based facial rehabilitation game based on mouth and tongue movements. 743-744 - Ahmad M. Qamar, Imad Afyouni, Delwar Hossain, Faizan Ur Rehman, Asad H. Toonsi, Mohamed Abdur Rahman, Saleh M. Basalamah:
A Multimedia E-Health Framework Towards An Interactive And Non-Invasive Therapy Monitoring Environment. 745-746 - Hongyun Cai, Zhongxian Tang, Yang Yang, Zi Huang:
EventEye: Monitoring Evolving Events from Tweet Streams. 747-748 - Yuan Tian, Suraj Raghuraman, Yin Yang, Xiaohu Guo, Balakrishnan Prabhakaran:
3D Immersive Cardiopulmonary Resuscitation (CPR) Trainer. 749-750 - Mei-Chen Yeh, Hsiao-Wei Lin:
Taking good selfies on your phone. 751-752
Demos 3: Systems
- Peng Wang, Yang Yang, Zi Huang, Jiewei Cao, Heng Tao Shen:
WeMash: An Online System for Web Video Mashup. 753-754 - Zhenhuan Gao, Chien-Nan (Shannon) Chen, Klara Nahrstedt:
FreeViewer: An Intelligent Director for 3D Tele-Immersion System. 755-756 - Jun Chen, Chaokun Wang, Lei Yang, Qingfu Wen, Xu Wang:
MiSCon: a hot plugging tool for real-time motion-based system control. 757-758 - Zhineng Chen, Jinfeng Bai, Chong-Wah Ngo, Bailan Feng, Bo Xu:
CeleLabel: an interactive system for annotating celebrities in web videos. 759-760 - Yoshiyuki Kawano, Keiji Yanai:
FoodCam-256: A Large-scale Real-time Mobile Food RecognitionSystem employing High-Dimensional Features and Compression of Classifier Weights. 761-762 - Daisuke Ochi, Yutaka Kunita, Kensaku Fujii, Akira Kojima, Shinnosuke Iwaki, Junichi Hirose:
HMD Viewing Spherical Video Streaming System. 763-764 - Duong-Trung-Dung Nguyen, Axel Carlier, Wei Tsang Ooi, Vincent Charvillat:
Jiku director 2.0: a mobile video mashup system with zoom and pan using motion maps. 765-766 - Mario Guggenberger, Mathias Lux, László Böszörményi:
ClockDrift: a mobile application for measuring drift in multimedia devices. 767-768
Posters 1
- Guangxin Ren, Junjie Cai, Shipeng Li, Nenghai Yu, Qi Tian:
Salable Image Search with Reliable Binary Code. 769-772 - Kota Yamaguchi, Tamara L. Berg, Luis E. Ortiz:
Chic or Social: Visual Popularity Analysis in Online Fashion Networks. 773-776 - Nakamasa Inoue, Koichi Shinoda:
n-gram Models for Video Semantic Indexing. 777-780 - Wei-Ta Chu, Ying-Chieh Chao:
Line-Based Drawing Style Description for Manga Classification. 781-784 - Fatih Çakir, Stan Sclaroff:
Supervised hashing with error correcting codes. 785-788 - Yubin Deng, Ping Luo, Chen Change Loy, Xiaoou Tang:
Pedestrian Attribute Recognition At Far Distance. 789-792 - Yin-Tzu Lin, Po-Nien Chen, Chia-Hu Chang, Ja-Ling Wu:
MSVA: Musical Street View Animator: An Effective and Efficient Way to Enjoy the Street Views of Your Journey. 793-796 - Parvez Ahammad, Brian Kennedy, Padmapani Ganti, Hariharan Kolam:
QoE-driven Unsupervised Image Categorization for Optimized Web Delivery: Short Paper. 797-800 - Zhengwei Huang, Ming Dong, Qirong Mao, Yongzhao Zhan:
Speech Emotion Recognition Using CNN. 801-804 - Shuyang Wang, Ming Shao, Yun Fu:
Attractive or Not?: Beauty Prediction with Attractiveness-Aware Encoders and Robust Late Fusion. 805-808 - Chin-Chia Michael Yeh, Ping-Keng Jao, Yi-Hsuan Yang:
AWtoolbox: Characterizing Audio Information Using Audio Words. 809-812 - Chih-Fan Hsu, De-Yu Chen, Chun-Ying Huang, Cheng-Hsin Hsu, Kuan-Ta Chen:
Screencast in the Wild: Performance and Limitations. 813-816 - Wei Jiang, Zhenyu Wu, John Wus, Hong Heather Yu:
One-Pass Video Stabilization on Mobile Devices. 817-820 - Bo Zhang, Yan Yan, Nicola Conci, Nicu Sebe:
You Talkin' to Me?: Recognizing Complex Human Interactions in Unconstrained Videos. 821-824 - Shoou-I Yu, Lu Jiang, Alexander G. Hauptmann:
Instructional Videos for Unsupervised Harvesting and Learning of Action Examples. 825-828 - Xiaobo Wang, Xiaochun Cao, Xiaojie Guo, Zhanjie Song:
Beautifying Fisheye Images using Orientation and Shape Cues. 829-832 - Jianfeng Xu, Shigeyuki Sakazawa:
Temporal Fusion Approach Using Segment Weight for Affect Recognition from Body Movements. 833-836 - Chun-Te Chu, Jaeyeon Jung, Zhicheng Liu, Ratul Mahajan:
sTrack: Secure Tracking in Community Surveillance. 837-840 - Na Zhao, Richang Hong, Meng Wang, Xuegang Hu, Tat-Seng Chua:
Searching for Recent Celebrity Images in Microblog Platform. 841-844 - Jiajun Wang, Yu-Gang Jiang, Qiang Wang, Kuiyuan Yang, Chong-Wah Ngo:
Organizing Video Search Results to Adapted Semantic Hierarchies for Topic-based Browsing. 845-848 - Xi Wang, Yu-Gang Jiang, Zhenhua Chai, Zichen Gu, Xinyu Du, Dong Wang:
Real-time summarization of user-generated videos based on semantic recognition. 849-852 - Yunhang Shen, Rongrong Ji, Donglin Cao, Min Wang:
Hacking Chinese Touclick CAPTCHA by Multi-Scale Corner Structure Model with Fast Pattern Matching. 853-856 - Kai Zhu, Dihong Gong, Zhifeng Li, Xiaoou Tang:
Orthogonal Gaussian Process for Automatic Age Estimation. 857-860 - Ying Zhang, Roger Zimmermann, Luming Zhang, David A. Shamma:
Points of Interest Detection from Multiple Sensor-Rich Videos in Geo-Space. 861-864 - Arindam Ghosh, Giuseppe Riccardi:
Recognizing Human Activities from Smartphone Sensor Signals. 865-868 - Honglin Yu, Lexing Xie, Scott Sanner:
Twitter-driven YouTube Views: Beyond Individual Influencers. 869-872 - Shijie Zhao, Xi Jiang, Junwei Han, Xintao Hu, Dajiang Zhu, Jinglei Lv, Tuo Zhang, Lei Guo, Tianming Liu:
Decoding Auditory Saliency from FMRI Brain Imaging. 873-876 - Huiyuan Fu, Huadong Ma, Hongtian Xiao:
Crowd Counting via Head Detection and Motion Flow Estimation. 877-880 - Prasanth Lade, Troy McDaniel, Sethuraman Panchanathan:
Semantic feature projection for continuous emotion analysis. 881-884 - Huiyuan Fu, Huadong Ma:
Real-time crowd detection based on gradient magnitude entropy model. 885-888 - Yang Mu, Henry Z. Lo, Wei Ding, Dacheng Tao:
Face Recognition from Multiple Images per Subject. 889-892 - Zhisheng Yan, Chang Wen Chen, Bin Liu:
Admission Control for Wireless Adaptive HTTP Streaming: An Evidence Theory Based Approach. 893-896 - Zheng Yang, Yao Hu, Haifeng Liu, Huajun Chen, Zhaohui Wu:
Matrix Completion for Cross-view Pairwise Constraint Propagation. 897-900 - Yueting Zhuang, Zhou Yu, Wei Wang, Fei Wu, Siliang Tang, Jian Shao:
Cross-Media Hashing with Neural Networks. 901-904 - Jie Nie, Peng Cui, Yan Yan, Lei Huang, Zhen Li, Zhiqiang Wei:
How Your Portrait Impresses People?: Inferring Personality Impressions from Portrait Contents. 905-908 - Bart Thomee, José G. Moreno, David A. Shamma:
Who's Time Is It Anyway?: Investigating the Accuracy of Camera Timestamps. 909-912 - Hanqi Wang, Fei Wu, Xi Li, Siliang Tang, Jian Shao, Yueting Zhuang:
Jointly Discovering Fine-grained and Coarse-grained Sentiments via Topic Modeling. 913-916 - Hao Kuang, Benjamin Guthier, Mukesh Kumar Saini, Dwarikanath Mahapatra, Abdulmotaleb El-Saddik:
A Real-Time Smart Assistant for Video Surveillance Through Handheld Devices. 917-920
Posters 2
- Jun Chen, Chaokun Wang, Jianmin Wang:
Modeling the Interest-Forgetting Curve for Music Recommendation. 921-924 - Bahetiyaer Bare, Ke Li, Weiyi Wang, Bo Yan:
Learning to Assess Image Retargeting. 925-928 - Bo Yan, Xiaochu Yang, Ke Li:
Efficient Image Retargeting via Adaptive Pixel Fusion. 929-932 - Haoqiang Fan, Mu Yang, Zhimin Cao, Yuning Jiang, Qi Yin:
Learning Compact Face Representation: Packing a Face into an int32. 933-936 - Yuanlu Xu, Bingpeng Ma, Rui Huang, Liang Lin:
Person Search in a Scene by Jointly Modeling People Commonness and Person Uniqueness. 937-940 - Tao Zhuo, Peng Zhang, Yanning Zhang, Wei Huang, Hichem Sahli:
Object Tracking using Reformative Transductive Learning with Sample Variational Correspondence. 941-944 - Di Wu, Ling Shao:
Multimodal Dynamic Networks for Gesture Recognition. 945-948 - Keisuke Doman, Taishi Tomita, Ichiro Ide, Daisuke Deguchi, Hiroshi Murase:
Event Detection based on Twitter Enthusiasm Degree for Generating a Sports Highlight Video. 949-952 - Hong Zhang, Junsong Yuan, Xingyu Gao, Zhenyu Chen:
Boosting cross-media retrieval via visual-auditory feature analysis and relevance feedback. 953-956 - Hui-Tang Chang, Yu-Chiang Frank Wang, Ming-Syan Chen:
Transfer in Photography Composition. 957-960 - Heysem Kaya, Albert Ali Salah:
Eyes Whisper Depression: A CCA based Multimodal Approach. 961-964 - Yehia Elkhatib, Rebecca Killick, Mu Mu, Nicholas J. P. Race:
Just Browsing?: Understanding User Journeys in Online TV. 965-968 - Xinyu Ou, Lingyu Yan, Hefei Ling, Cong Liu, Maolin Liu:
Inductive Transfer Deep Hashing for Image Retrieval. 969-972 - Jianguang Zhang, Yahong Han, Jinhui Tang, Qinghua Hu, Jianmin Jiang:
What Can We Learn about Motion Videos from Still Images? 973-976 - Felix X. Yu, Liangliang Cao, Michele Merler, Noel C. F. Codella, Tao Chen, John R. Smith, Shih-Fu Chang:
Modeling Attributes from Category-Attribute Proportions. 977-980 - Yang Wang, Xuemin Lin, Lin Wu, Wenjie Zhang, Qing Zhang:
Exploiting Correlation Consensus: Towards Subspace Clustering for Multi-modal Data. 981-984 - Xinyan Lu, Fei Wu, Xi Li, Yin Zhang, Weiming Lu, Donghui Wang, Yueting Zhuang:
Learning Multimodal Neural Network with Ranking Examples. 985-988 - Viet Anh Nguyen, Jiwen Lu, Minh N. Do:
Supervised Discriminative Hashing for Compact Binary Codes. 989-992 - Zhenxing Niu, Shiliang Zhang, Xinbo Gao, Qi Tian:
Personalized Visual Vocabulary Adaption for Social Image Retrieval. 993-996 - Xiaochun Cao, Yupeng Cheng, Zhiqiang Tao, Huazhu Fu:
Co-Saliency Detection via Base Reconstruction. 997-1000 - Hong-Wun Jheng, Bor-Chun Chen, Yan-Ying Chen, Winston H. Hsu:
Automatic Facial Image Annotation and Retrieval by Integrating Voice Label and Visual Appearance. 1001-1004 - Tianxu Ji, Xianglong Liu, Cheng Deng, Lei Huang, Bo Lang:
Query-Adaptive Hash Code Ranking for Fast Nearest Neighbor Search. 1005-1008 - Stefan Wilk, Wolfgang Effelsberg:
Systematic Assessment of the Video Recording Position for User-generated Event Videos. 1009-1012 - Hanhui Li, Donghui Li, Xiaonan Luo:
BAP: Bimodal Attribute Prediction for Zero-Shot Image Categorization. 1013-1016 - Michael Xuelin Huang, Tiffany C. K. Kwok, Grace Ngai, Hong Va Leong, Stephen C. F. Chan:
Building a Self-Learning Eye Gaze Model from User Interaction Data. 1017-1020 - Alexandru-Lucian Gînsca, Adrian Popescu, Bogdan Ionescu, Anil Armagan, Ioannis Kanellos:
Toward an Estimation of User Tagging Credibility for Social Image Retrieval. 1021-1024 - Sicheng Zhao, Hongxun Yao, You Yang, Yanhao Zhang:
Affective Image Retrieval via Multi-Graph Learning. 1025-1028 - Valentin Leveau, Alexis Joly, Olivier Buisson, Pierre Letessier, Patrick Valduriez:
Recognizing Thousands of Legal Entities through Instance-based Visual Classification. 1029-1032 - Evlampios Apostolidis, Vasileios Mezaris, Mathilde Sahuguet, Benoit Huet, Barbora Cervenková, Daniel Stein, Stefan Eickeler, José Luis Redondo García, Raphaël Troncy, Lukás Pikora:
Automatic fine-grained hyperlinking of videos within a closed collection using scene segmentation. 1033-1036 - Rui Hu, Carlos Pallan Gayol, Guido Krempel, Jean-Marc Odobez, Daniel Gatica-Perez:
Automatic Maya hieroglyph retrieval using shape and context information. 1037-1040 - Justin Salamon, Christopher Jacoby, Juan Pablo Bello:
A Dataset and Taxonomy for Urban Sound Research. 1041-1044 - Lin Chen, Peng Zhang, Baoxin Li:
Instructive Video Retrieval Based on Hybrid Ranking and Attribute Learning: A Case Study on Surgical Skill Training. 1045-1048 - Shu Shi, John W. Barrus:
A Real-Time Smart Display Detection System. 1049-1052 - Shuang Ma, Yangyu Fan, Chang Wen Chen:
Pose Maker: A Pose Recommendation System for Person in the Landscape Photographing. 1053-1056 - Matthew Prockup, Jeffrey J. Scott, Youngmoo E. Kim:
Representing Musical Patterns via the Rhythmic Style Histogram Feature. 1057-1060 - Kolbeinn Karlsson, Wei Jiang, Dong-Qing Zhang:
Mobile Photo Album Management with Multiscale Timeline. 1061-1064 - Lonce Wyse:
Interactive Audio Web Development Workflow. 1065-1068 - Yang Liu, Yan Liu, Yu Zhao, Kien A. Hua:
What Strikes the Strings of Your Heart?: Multi-Label Dimensionality Reduction for Music Emotion Analysis. 1069-1072 - Bing Xu, Xiaogang Wang, Xiaoou Tang:
Fusing Music and Video Modalities Using Multi-timescale Shared Representations. 1073-1076
Posters 3
- Yang Zhou, Weiyao Lin, Hang Su, Jianxin Wu, Jinjun Wang, Yu Zhou:
Representing And Recognizing Motion Trajectories: A Tube And Droplet Approach. 1077-1080 - Huizhong Chen, Matthew Cooper, Dhiraj Joshi, Bernd Girod:
Multi-modal Language Models for Lecture Video Retrieval. 1081-1084 - Hokuto Kagaya, Kiyoharu Aizawa, Makoto Ogawa:
Food Detection and Recognition Using Convolutional Neural Network. 1085-1088 - Kang Zhao, Hongtao Lu, Yangcheng He, Shaokun Feng:
Locality Preserving Discriminative Hashing. 1089-1092 - Xiaochun Cao, Xingxing Wei, Xiaojie Guo, Yahong Han, Jinhui Tang:
Augmented Image Retrieval using Multi-order Object Layout with Attributes. 1093-1096 - Klaus Schoeffmann:
The Stack-of-Rings Interface for Large-Scale Image Browsing on Mobile Touch Devices. 1097-1100 - Fabio Celli, Elia Bruni, Bruno Lepri:
Automatic Personality and Interaction Style Recognition from Facebook Profile Pictures. 1101-1104 - Chen Fang, Zhe Lin, Radomír Mech, Xiaohui Shen:
Automatic Image Cropping using Visual Composition, Boundary Simplicity and Content Preservation Models. 1105-1108 - Kezhen Teng, Jinqiao Wang, Min Xu, Hanqing Lu:
Mask Assisted Object Coding with Deep Learning for Object Retrieval in Surveillance Videos. 1109-1112 - Huiying Liu, Min Xu, Xiangjian He, Jinqiao Wang:
Estimate Gaze Density by Incorporating Emotion. 1113-1116 - Chun-Chieh Hsu, Hua-Tsung Chen, Chien-Li Chou, Chien-Peng Ho, Suh-Yin Lee:
Trajectory Based Jump Pattern Recognition in Broadcast Volleyball Videos. 1117-1120 - Masakazu Iwamura, Nobuaki Matozaki, Koichi Kise:
Fast Instance Search Based on Approximate Bichromatic Reverse Nearest Neighbor Search. 1121-1124 - Xufang Pang, Ying Cao, Rynson W. H. Lau, Antoni B. Chan:
A Robust Panel Extraction Method for Manga. 1125-1128 - Song Wu, Michael S. Lew:
RIFF: Retina-inspired Invariant Fast Feature Descriptor. 1129-1132 - Sabrina Schulte, Chien-Nan (Shannon) Chen, Klara Nahrstedt:
Stevens' Power Law in 3D Tele-immersion: Towards Subjective Modeling of Multimodal Cyber Interaction. 1133-1136 - Zhiqiang Zuo, Yong Luo, Dacheng Tao, Chao Xu:
Multi-view Multi-task Feature Extraction for Web Image Classification. 1137-1140 - Shanmin Pang, Jianru Xue, Zhanning Gao, Qi Tian:
Image Re-ranking with an Alternating Optimization. 1141-1144 - Edgar Roman-Rangel, Stéphane Marchand-Maillet:
Automatic Removal of Visual Stop-Words. 1145-1148 - Sho Inaba, Asako Kanezaki, Tatsuya Harada:
Automatic Image Synthesis from Keywords Using Scene Context. 1149-1152 - Noura Al Moubayed, Yolanda Vazquez-Alvarez, Alex McKay, Alessandro Vinciarelli:
Face-Based Automatic Personality Perception. 1153-1156 - Chia-Hung Lin, Yan-Ying Chen, Bor-Chun Chen, Yu-Lin Hou, Winston H. Hsu:
Facial Attribute Space Compression by Latent Human Topic Discovery. 1157-1160 - Mohammad Soleymani, Anna Aljanaki, Yi-Hsuan Yang, Michael N. Caro, Florian Eyben, Konstantin Markov, Björn W. Schuller, Remco C. Veltkamp, Felix Weninger, Frans Wiering:
Emotional Analysis of Music: A Comparison of Methods. 1161-1164 - Masaru Mizuochi, Asako Kanezaki, Tatsuya Harada:
Clothing Retrieval Based on Local Similarity with Multiple Images. 1165-1168 - Markus Koskela, Jorma Laaksonen:
Convolutional Network Features for Scene Recognition. 1169-1172 - Andrew Hines, Eoin Gillen, Damien Kelly, Jan Skoglund, Anil C. Kokaram, Naomi Harte:
Perceived Audio Quality for Streaming Stereo Music. 1173-1176 - Christos Georgakis, Stavros Petridis, Maja Pantic:
Discriminating Native from Non-Native Speech Using Fusion of Visual Cues. 1177-1180 - Guoyu Lan, Heng Qi, Keqiu Li, Kai Lin, Wenyu Qu, Zhiyang Li:
A Framework of Mobile Visual Search Based on the Weighted Matching of Dominant Descriptor. 1181-1184 - Che-Chun Lee, Yin-Hsi Kuo, Winston H. Hsu, Shin'ichi Satoh, Sebastian Agethen:
Efficient Cross-Domain Image Retrieval by Multi-Level Matching and Spatial Verification for Structural Similarity. 1185-1188 - Emre Yilmaz, Konstantinos Rematas, Tinne Tuytelaars, Hugo Van hamme:
Learning Like a Toddler: Watching Television Series to Learn Vocabulary from Images and Audio. 1189-1192 - Yu Bao, Jing Yang, Liangliang Cao, Haojie Li, Jinhui Tang:
Cuteness Recognition and Localization in the Photos of Animals. 1193-1196 - Jun Wu, Wenjing Qiao, Cailiang Kuang, Zhenbao Liu, Shuhui Bu, Junwei Han:
A 3D Fingertips Detecting and Tracking Algorithm based on the Sliding Window. 1197-1200 - Dubravko Culibrk, Nicu Sebe:
Temporal Dropout of Changes Approach to Convolutional Learning of Spatio-Temporal Features. 1201-1204 - Lorenz Kellerer, Vamsidhar Reddy Gaddam, Ragnar Langseth, Håkon Kvale Stensland, Carsten Griwodz, Dag Johansen, Pål Halvorsen:
Real-Time HDR Panorama Video. 1205-1208 - Yunhua Deng, Siqi Shen, Zhe Huang, Alexandru Iosup, Rynson W. H. Lau:
Dynamic Resource Management in Cloud-based Distributed Virtual Environments. 1209-1212 - Masayuki Furukawa, Yasuhiro Akagi, Yukiko Kawai, Hiroshi Kawasaki:
Interactive 3D Animation Creation and Viewing System based on Motion Graph and Pose Estimation Method. 1213-1216 - Kentaro Yamada, Hiroshi Sankoh, Sei Naito:
Color Transfer based on Spatial Structure for Telepresence. 1217-1220 - Zhen-Peng Bian, Junhui Hou, Lap-Pui Chau, Nadia Magnenat-Thalmann:
Human Computer Interface for Quadriplegic People Based on Face Position/gesture Detection. 1221-1224 - Lasse Farnung Laursen, Masataka Goto, Takeo Igarashi:
A Multi-Touch DJ Interface with Remote Audience Feedback. 1225-1228
Tutorials
- Wanmin Wu, Cha Zhang:
Immersive 3D Communication. 1229-1230 - Christian Timmerer, Ali C. Begen:
Over-the-Top Content Delivery: State of the Art and Challenges Ahead. 1231-1232 - Yi Yu, Kiyoharu Aizawa, Toshihiko Yamasaki, Roger Zimmermann:
Emerging Topics on Personalized and Localized Multimedia Information Systems. 1233-1234 - Lexing Xie, Haixun Wang:
Learning Knowledge Bases for Text and Multimedia. 1235-1236 - Peng Cui, Lexing Xie, Jitao Sang, Changsheng Xu:
Social multimedia computing. 1237-1238 - Vasileios Mezaris, Benoit Huet:
Video hyperlinking. 1239-1240 - David A. Shamma, Daragh Byrne:
An Introduction to Arts and Digital Culture Inside Multimedia. 1241-1242
Workshop Summaries
- Michel F. Valstar, Björn W. Schuller, Jarek Krajewski, Roddy Cowie, Maja Pantic:
AVEC 2014: the 4th international audio/visual emotion challenge and workshop. 1243-1244 - Fabio Celli, Bruno Lepri, Joan-Isaac Biel, Daniel Gatica-Perez, Giuseppe Riccardi, Fabio Pianesi:
The Workshop on Computational Personality Recognition 2014. 1245-1246 - Judith A. Redi, Mathias Lux:
CrowdMM14 - 2014 International ACM Workshop on Crowdsourcing for Multimedia. 1247-1248 - M. Anwar Hossain, Abdulmotaleb El-Saddik:
EMASC14: 1st International Workshop on Emerging Multimedia Applications and Services for Smart Cities. 1249-1250 - Liangliang Cao, Gerald Friedland, Lexing Xie:
GeoMM 2014: the third ACM multimedia workshop ongeotagging and its applications in multimedia. 1251-1252 - Ansgar Scherp, Vasileios Mezaris, Bogdan Ionescu, Francesco G. B. De Natale:
HuEvent'14: 2014 workshop on human-centered event understanding from multimedia. 1253-1254 - Teresa Chambel, Paula Viana, V. Michael Bove Jr., Sharon Strover, Graham Thomas:
ImmersiveMe'14: 2nd ACM international workshop on immersive media experiences. 1255-1256 - Roger Zimmermann, Yi Yu:
WISMM'14 - First ACM International Workshop on Internet-Scale Multimedia Management. 1257-1258 - Pablo César, David A. Shamma, Matthew Cooper, Aisling Kelliher:
3rd International Workshop on Socially-Aware Multimedia (SAM'14). 1259-1260 - Concetto Spampinato, Vasileios Mezaris, Marco Cristani:
Summary Abstract for the 3rd ACM International Workshop on Multimedia Analysis for Ecological Data. 1261-1262 - Hari Kalva, Homer H. Chen, Gerardo Fernández-Escribano, Velibor Adzic:
PIVP 2014: First International Workshop on Perception Inspired Video Processing. 1263-1264 - Wolfgang Effelsberg, Stefan Göbel:
Serious Games 2014: International Workshop on Serious Games. 1265-1266
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.