Workshops
Monday, July 8, 2019
Monday, July 8, 2019
W-01: Multimedia Services and Technologies for Smart-health(MUST-SH)
Time: 8:30 AM - 17:00 PM
Room: 5F
Organizers: Shamim Hossain King Saud University, Saudi Arabia Stefan Goebel KOM, TU Darmstadt, Germany
Yin Zhang Zhongnan University of Economics and Law, China
8:30 - 8:35 Opening Remarks:
Yin Zhang Zhongnan University of Economics and Law, China
8:35 - 9:30 Keynote Talk:
Huimin Lu Kyushu Institute of Technology, Japan
9:30 - 10:00 Oral Session 1:
Session Chair: Shamim Hossain King Saud University, Saudi Arabia
FULLY CONVOLUTIONAL NETWORK FOR 3D HUMAN SKELETON ESTIMATION FROM A SIN- GLE VIEW FOR ACTION ANALYSIS
Wen-Nung Lie1, Guan-Han Lin1, Lung-Sheng Shih1, YuLing Hsu1, Thang Huu Nguyen2, Quynh Nguyen Quang Nhu2
1National Chung Cheng University, Taiwan, 2The University of Danang, University of Science and Technology, Vietnam
10:00 - 10:30 Coffee Break
10:30 - 12:00 Oral Session 2:
Session Chair: Stefan Goebel KOM, TU Darmstadt, Germany
10:30 - 11:00
ATTENTION BASED SEMI-SUPERVISED DICTIONARY LEARNING FOR DIAGNOSIS OF AU- TISM SPECTRUM DISORDERS
Meng Yang1,2, Qin Zhong1, Lin Chen3, Fanglin Huang4, Baiying Lei4
1Sun Yat-sen University, Guangzhou, China, 2Key Laboratory of Machine Intelligence and Advanced Comput- ing(SYSU), Ministry of Education, 3Sogou, China, 4Shenzhen University, China
11:00 - 11:30
RT-ADI: FAST REAL-TIME VIDEO REPRESENTATION FOR MULTI-VIEW HUMAN FALL DETEC- TION
Qianggang Ding, Fan Yang, Jiawei Li, Sifan Wu, Bowen Zhao, Zhi Wang, Shutao Xia
Tsinghua University, China
11:30 - 12:00
A NEW IMAGE WATERMARKING SCHEME FOR EFFICIENT TAMPER DETECTION, LOCALIZA- TION AND RECOVERY
Faranak Tohidi, Manoranjan Paul
Charles Sturt University, Australia
12:00 - 13:30 Lunch Break
13:30 - 15:00 Oral Session 3:
Session Chair: Yin Zhang Zhongnan University of Economics and Law, China
13:30 - 14:00
PREDICTING HUMAN GRASP LOCATIONS ON CUP HANDLES BY USING DEEP NEURAL NET- WORKS TO INFER HEAT SIGNATURES FROM DEPTH DATA
Yijun Jiang, Sean Banerjee, Natasha Kholgade Banerjee
Clarkson University, USA
14:00 - 14:30
HIERARCHICAL FUZZY INFERENCE SYSTEM FOR DIAGNOSING DENGUE DISEASE
Mubarak Alrashoud
King Saud University, Saudi Arabia
14:30 - 15:00
HUMAN-INTERACTION WEAKLY-SUPERVISED DEEP NETWORKS FOR SEMANTIC SEGMEN- TATION
Wenfeng Luo1, Meng Yang1,2
1Sun Yat-sen University, China, 2Key Laboratory of Machine Intelligence and Advanced Computing (SYSU), Ministry of Educationl, China
15:00 - 15:30 Coffee Break
15:30 - 17:00 Oral Session 4:
Session Chair: Shamim Hossain King Saud University, Saudi Arabia
15:30 - 16:15
THE PREDICTION MODEL OF BLOOD GLUCOSE CONCENTRATION FOR SMART HEALTH
Han Yu, Jianmin Lu, Yue JIn, Binglei Yue, Xiao Ma Zhongnan University of Economics and Law, China 16:15 - 17:00
PREDICTING SPINE SURGERY COMPLICATIONS USING MACHINE LEARNING
Mohamad Hoda1, Abdulmotaleb EI Saddik1, Eugene Wai2, Philippe Phan3
1University of Ottawa, Canada, 2The Ottawa Hospital, Canada, 3The Ottawa Hospital, Canada
W-02: International Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia (MMArt-ACM)
Time: 8:30 AM - 12:00 PM
Room: 5H
Organizers: Wei-Ta Chu National Chung Cheng University, Taiwan
Norimichi Tsumura Graduate School of Engineering, Chiba University, Japan Shoji Yamamoto Tokyo Metropolitan College of Industrial Technology, Japan Toshihiko Yamasaki University of Tokyo, Japan
8:30 - 8:35 Opening Remarks:
Session Chair: Toshihiko Yamasaki
8:35 - 9:50 Oral Session 1: Multimedia Artworks Analysis
Session Chair: Norimichi Tsumura, Toshihiko Yamasaki
8:35 - 8:50
DEEPIR: A DEEP SEMANTICS DRIVEN FRAMEWORK FOR IMAGE RETARGETING
Jianxin Lin, Tiankuang Zhou, Zhibo Chen
University of Science and Technology of China, China
8:50 - 9:05
MULTI-DEPTH DILATED NETWORK FOR FASHION LANDMARK DETECTION
Zeng Kai, Jun Feng, Richard F E Sutcliffe, Wang Xiaoyu, Bu Qirong
NorthWest University, China
9:05 - 9:20
SALIENCY-GUIDED IMAGE STYLE TRANSFER
Xiuwen Liu, Zhi Liu, Xiaofei Zhou, Minyu Chen
Shanghai University, China
9:20 - 9:35
A MULTIMEDIA-BASED MOVIE STYLE MODEL
Priyankar Choudhary, Neeraj Goel, Mukesh Saini
Indian Institute of Technology Ropa, India
9:35 - 9:50
NEURAL STYLE TRANSFER WITH CONTENT DISCRIMINATION
Xiyu Yan, Yeli Xing, Zihao He, Tao Dai, Yong Jiang, Shutao Xia
Tsinghua University, China
10:00 - 10:30 Coffee Break
10:30 - 11:30 Keynote talk by Prof. Jia Jia
Session Chair: Toshihiko Yamasaki
11:30 - 12:00 Oral Session 2: Attractiveness Computing in Multimedia
Session Chair: Wei-Ta Chu
11:30 - 11:45
PREDICTING THE ATTRACTIVENESS OF REAL-ESTATE IMAGES BY PAIRWISE COMPARISON USING DEEP LEARNING
Xueting Wang, Yuki Takada, Youiti Kado, Toshihiko Yamasaki
The University of Tokyo, Japan
11:45 - 12:00
VIDEO-BASED STRESS LEVEL MEASUREMENT USING IMAGING PHOTOPLETHYSMOGRA- PHY
Ryota Mitsuhashi1, Kaito Iuchi1, Takashi Goto2, Akira Matsubara2, Takahiro Hirayama2, Hideki Hashizume2, Norimichi Tsumura1
1Chiba University, Japan, 2Daikin Industries LTD, Japan
W-03: Visual Emotion Analysis: Theories and Applications
Time: 13:30 - 17:30 PM
Room: 5H
Organizers: Lifang Wu Beijing University of Technology, China Jufeng Yang Nankai University, China
Rongrong Ji Xiamen University, China
13:30 - 13:35 Opening Remarks
13:35 - 14:30 Keynote: Computation of Emotion (Jiebo Luo)
14:30 - 15:00 Invited Talk 1: Affective and aesthetic computing on social images (Jia Jia)
15:00 - 15:30 Coffee Break
15:30 - 16:00 Invited Talk 2: Visual sentiment analysis and beyond (Yanwei Fu)
16:00 - 16:30 Invited Talk 3: Weakly supervised coupled networks for visual sentiment analysis (Dongyu She) 16:30 -16:50
FEAFA: A WELL ANOATED DATABASE FOR FACIAL EXPRESSION ANALYSIS AND 3D FACIAL ANIMATION
Yanfu Yan1, Ke Lu1, Jian Xue1, Pengcheng Gao1, Jiayi Lyu2
1University of Chinese Academy of Sciences, China 2Capital Normal University, China
16:50 - 17:10
CROSS-DATABASE MICRO-EXPRESSION RECOGNITION: A STYLE AGGREGATED AND AT- TENTION TRANSFER APPROACH
Ling Zhou, Qirong Mao, Luoyang Xue
Jiangsu University, China
17:10 -17:30
THE FUSION KNOWLEDGE OF FACE, BODY AND CONTEXT FOR EMOTION RECOGNITION
Jingjing Wu, Yong Zhang, Li Ning
Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, China
W-04: 1st International Workshop on Big Surveillance Data Analysis and Pro- cessing
Time: 8:30 AM - 12:00 PM
Room: 5I
Organizers: Weiyao Lin Shanghai Jiao Tong University, China John See Multimedia University, Malaysia
Michael Ying Yang University of Twente, the Netherlands
8:30 - 10:00 Oral Session 1: Object Motion Analysis in Big Surveillance Videos
Session Chair: Weiyao Lin, Michael Ying Yang
8:30 - 8:45
DEFORMATION SAMPLE GENERATED NETWORK FOR ROBUST VISUAL TRACKING
Zizi Li, Yuan Zhou, Chunping Hou
Tianjin University, China
8:45 - 9:00
PRESERVING STRUCTURAL RELATIONSHIPS FOR PERSON RE-IDENTIFICATION
Liqiang Bao1, Bingpeng Ma1, Hong Chang2, Xilin Chen2
1University of Chinese Academy of Sciences, China 2Chinese Academy of Sciences, China
9:00 - 9:15
ADAPTIVE UPDATING SIAMESE NETWORK WITH LIKE-HOOD ESTIMATION FOR SURVEIL- LANCE VIDEO OBJECT TRACKING
Zhenxian Zheng, Yang Yi, Jinlong Shen, Jiahao Zhang
Sun Yat-sen University, China
9:15 - 9:30
A MULTIMODAL LOSSLESS CODING METHOD FOR SKELETONS IN VIDEOS
Xiaoyi He, Mingzhou Liu, Weiyao Lin, Xintong Han, Yanmin Zhu, Hongtao Lu, Hongkai Xiong
Shanghai Jiao Tong University, China
9:30 - 9:45
EFFICIENT SEMANTIC-BASED VEHICLE RETRIEVAL IN LONG-TERM CAR PARK VIDEOS
Clarence Weihan Cheong, Ryan Woei-Sheng Lim, John See, Lai-Kuan Wong, Ian Kim Teck Tan
Multimedia University, Malaysia
9:45 - 10:00
SINGLE IMAGE HAZE REMOVAL BY FEATURE MAPPING
Feiniu Yuan1, Yu Zhou2, Xue Xia2, Ya Li2
1Shanghai Normal University, China, 2Jiangxi University of Finance and Economics, China
10:00 - 10:30 Coffee Break
10:30 - 12:00 Oral Session 2: Human & Action Sensing for Big Surveillance Videos
Session Chair: Weiyao Lin, Michael Ying Yang
10:30 - 10:45
MOTION-LET CLUSTERING FOR SKELETON-BASED ACTION RECOGNITION
Jianyu Yang1, Chen Zhu1, Junsong Yuan2
1Soochow University, China, 2State University of New York at Buffalo, USA
10:45 - 11:00
DEEP KEY CLIPS-VIDEO FEATURE FUSION FRAMEWORK FOR ACTION RECOGNITION
Chao Li1, Yue Ming1, Yuan Shen2, Hui Yu3
1Beijing University of Posts and Telecommunications, China 2Tencent Technology (Beijing) Co., Ltd, China
3University of Portsmouth, UK
11:00 - 11:15
HUMAN IDENTIFICATION RECOGNITION IN SURVEILLANCE VIDEOS
Kai Jin, Xuemei Xie, Fangyu Wang, Xiao Han, Guangming Shi
Xidian University, China
11:15 - 11:30
AGE ESTIMATION FOR LOW-QUALITY FACIAL IMAGES: FROM SEPARATE DCNNS TO A DE- CISION FUSER
Kuan-Hsien Liu1, Pak Ki Chan2, Tsung-Jung Liu3, Hsiu-An Her1
1National Taichung University of Science and Technology, Taiwan, 2China Medical University Hospital,China
3National Chung Hsing University, Taiwan
11:30 - 11:45
SEMANTIC SEGMENTATION OF SATELLITE IMAGES USING A U-SHAPED FULLY CONNECT- ED NETWORK WITH DENSE RESIDUAL BLOCKS
Eric R Narciso Molina, Zenghui Zhang Shanghai Jiao Tong University, China 11:45 - 12:00
MTCNN WITH WEIGHTED LOSS PENALTY AND ADAPTIVE THRESHOLD LEARNING FOR FA- CIAL ATTRIBUTE PREDICTION
Xingting He, Pingyu Wang, Zhicheng Zhao, Yanyun Zhao, Fei Su
Beijing University of Posts and Telecommunications, China
W-05: Multimedia for Robot, Unmanned Aerial Vehicle and Driverless Car
Time: 13:30 - 17:00 PM
Room: 5I
Organizers: Dong Zhao Beijing University of Posts and Telecommunications, China Chenqiang Gao Chongqing University of Posts and Telecommunications, China Jiayi Ma Wuhan University, China
Quan Zhou Nanjing University of Posts and Telecommunications, China Ji Zhao TuSimple, China
Yu Zhou Beijing University of Posts and Telecommunications, China
13:30 - 13:35 Opening Remarks:
Yu Zhou Huazhong University of Science and Technology, China
13:35 - 14:10 Keynote Talk:
Yiqun Li Huazhong University of Science and Technology, China
14:10 - 14:45 Keynote Talk:
Chen Chen University of North Carolina at Charlotte, USA
14:45 - 15:05 Oral Session 1:
Session Chair: Dong Zhao
14:45 -15:05
MULTI-PATH FUSION NETWORK FOR HIGH-RESOLUTION HEIGHT ESTIMATION FROM A SINGLE ORTHOPHOTO
Yiteng Zhang, Xuejin Chen
University of Science and Technology of China, China
15:05 - 15:25 Coffee Break
15:25 - 16:00 Keynote Talk:
Lin Zhang Tongji University, China
16:00 - 17:00 Oral Session 2:
Session Chair:Jiayi Ma
16:00 - 16:20
FACE ANTI-SPOOFING BASED ON MULTI-LAYER DOMAIN ADAPTATION
Fengshun Zhou1,2, Chenqiang Gao1,2, Fang Chen1,2, Chaoyu Li1,2, Xindou L1,2, Feng Yang1,2, Yue Zhao1,2
1Chongqing University of Posts and Telecommunications, Chongqing, China, 2Chongqing Key Laboratory of Signal and Information Processing, Chongqing 400065, China
16:20 - 16:40
SELF-ATTENTION RELATION NETWORK FOR FEW-SHOT LEARNING
Binyuan Hui, Pengfei Zhu, Qinghua Hu, Qilong Wang
Tianjin University, China
16:40 - 17:00
BISE-RESNET: COMBINE SEGMENTATION AND CLASSIFICATION NETWORKS FOR ROAD FOLLOWING ON UNMANNED AERIAL VEHICLE
Dian Lyu, Peng Cheng, Ruizhou Liu, Liang Liu
Beijing University of Posts and Telecommunication, China
W-06: Information Theory and Multimedia Computing (ITMC)
Time: 8:30 AM - 16:30 PM
Room: 5J
Organizers: Ran He Chinese Academy of Sciences, China Xiaotong Yuan Nanjing University, China Jitao Sang Beijing Jiaotong University, China
8:50 - 9:00 Opening
9:00 - 10:00 Keynote Talk: Ran He
10:00 - 10:15 Coffee Break
10:15 - 11:45 Oral Session 1:
Session Chair: Ran He
10:15 – 10:30
HYBRID DEFENSE FOR DEEP NEURAL NETWORKS: AN INTEGRATION OF DETECTING AND CLEANING ADVERSARIAL PERTURBATIONS
Weiqi Fan, Guangling Sun, Yuying Su, Zhi Liu, Xiaofeng Lu
Shanghai University, China
10:30 – 10:45
SKETCH-BASED IMAGE RETRIEVAL VIA A SEMI-HETEROGENEOUS CROSS-DOMAIN NET- WORK
Chuo Li, Yuan Zhou, Jianxing Yang Tianjin University, Tianjin, China 10:45 – 11:00
QUESTION SPLITTING AND UNBALANCED MULTI-MODAL POOLING FOR VQA
Mengfei Li, Huan Shao, Yi Ji, Yang Yang, ChunPing Liu
Soochow University Suzhou, Jiangsu, China
11:00 – 11:15
AI-GAN: SIGNAL DE-INTERFERENCE VIA ASYNCHRONOUS INTERACTIVE GENERATIVE AD- VERSARIAL NETWORK
Xin Jin, Zhibo Chen, Jianxin Lin, Wei Zhou, Jiale Chen, Chaowei Shan
University of Science and Technology of China, Hefei, China
11:15 – 11:30
Visual object tracking via Graph Convolutional Representation
Zhengzheng Tu, Ajian Zhou, Bo Jiang, Bin Luo
Anhui University, China
11:30 – 11:45
MOIRE PATTERN REMOVAL WITH MULTI-SCALE FEATURE ENHANCING NETWORK
Tian yu Gao1, Yanqing Guo1, Xin Zheng1, Qianyu Wang1, Xiangyang Luo2
1Dalian University of Technology, China 2The State Key Laboratory of Mathematical Engineering and Advanced Computing, China
12:00 - 13:30 Lunch Break
13:30 - 15:00 Oral Session 2:
Session Chair: Yi Li
13:30 – 13:45
DEEP COLOR IMAGE DEMOSAICKING WITH FEATURE PYRAMID CHANNEL ATTENTION.
Qi Kang, Ying Fu, Hua Huang Beijing Institute of Technology, China 13:45 – 14:00
REAL-WORLD IMAGE DENOISING VIA WEIGHTED LOW RANK APPROXIMATION.
Yuenan Guo, Ying Fu, Hua Huang Beijing Institute of Technology, China 14:00 – 14:15
TWO-STRE SPARSE NETWORK FOR ACCURATE IMAGE SUPER-RESOLUTION.
Ling Hu1,2, Shuhui Wang1, Liang Li1, Qingming Huang1,2
1Key Lab of Intell. Info. Process., Inst. of Comput. Tech., CAS, China, 2University of Chinese Academy of Sci- ences, Beijing, 100049, China
14:15 – 14:30
EMBEDDING NON-LOCAL MEAN IN SQUEEZE-AND-EXCITATION NETWORK FOR SINGLE IMAGE DERAINING.
Cong Wang, Hongyan Wang, Zhixun Su, Yan Yang
Dalian University of Technology, China
14:30 – 14:45
RELATIVE DEPTH ESTIMATION PRIOR FOR SINGLE IMAGE DEHAZING.
Jinbao Wang1, Ke Lu1, Jian Xue1, Yutong Kou2
1University of Chinese Academy of Sciences, China 2Huazhong University of Science & Technology, China
14:45 – 15:00
LOW-LIGHT IMAGE ENHANCEMENT WITH ATTENTION AND MULTI-LEVEL FEATURE FU- SION.
Lei Wang1, guangtao fu2, zhuqing jiang1, Guodong Ju3, aidong men1
1Beijing University of Posts and Telecommunications, China, 2Academy of Broadcasting Science, China,
3GuangDong TUS-TuWei Technology Co, Ltd, China
15:00 - 15:30 Coffee Break
15:30 - 16:30 Oral Session 3:
Session Chair: Yi Li
15:30 – 15:45
BLIND MESH QUALITY ASSESSMENT METHOD BASED ON CONCAVE, CONVEX AND STRUC- TURAL FEATURES ANALYSES.
Yaoyao Lin, Mei Yu, Ken Chen, Gangyi Jiang, Zongju Peng, Fen Chen
Faculty of Information Science and Engineering, Ningbo University, Ningbo, China
15:45 – 16:00
K-COVERS FOR ACTIVE LEARNING IN IMAGE CLASSIFICATION.
Yeji Shen1, Yuhang Song1, Hanhan Li2, Shahab Kamali2, Bin Wang1, C.-C. Jay Kuo1
1University of Southern California, USA, 2Google Research, USA
16:00 – 16:15
DISTRIBUTION DISCREPANCY MAXIMIZATION FOR IMAGE PRIVACY PRESERVING.
Sen Liu, Jianxin Lin, Zhibo Chen
University of Science and Technology of China, China
16:15 – 16:30
A NOVEL DISTANCE LEARNING FOR ELASTIC CROSS MODAL AUDIO-VISUAL MATCHING.
Rui Wang1, Huaibo Huang2,3, Xufeng Zhang1, Jixin Ma4, Aihua Zheng1
1Anhui University, China, 2University of Chinese Academy of Sciences, China, 3CASIA, China, 4University of Greenwich, UK
W-07: 6th IEEE International Workshop on Mobile Multimedia Computing (MMC)
Time: 8:30 AM - 12:00 PM
Room: 5F
Organizers: Tian Gan Shandong University, China
Wen-Huang Cheng National Chiao Tung University, Taiwan
Kai-Lung Hua National Taiwan University of Science and Technology, Taiwan
Klaus Schoeffmann Klagenfurt University, Austria
Vladan Velisavljevic University of Bedfordshire, UK
Christian von der Weth National University of Singapore, Singapore
8:30 - 9:00 Opening & Keynotes
9:00 - 10:00 Oral Session 1:
Session Chair: Wen-Huang Cheng
9:00 - 09:15
FINE DETECTION AND CLASSIFICATION OF MULTI-CLASS BARCODE IN COMPLEX ENVI- RONMENTS
Jiahe Zhang1, Jun Jia1, Zehao Zhu1, Xiongkuo Min1, Guangtao Zhai1, Xiao-Ping Zhang2
1Shanghai Jiao Tong University, China, 2Ryerson University, Canada
9:15 - 09:30
DEEP LEARNING BASED METHOD FOR PRUNING DEEP NEURAL NETWORKS
Lianqiang Li1, Jie Zhu1, Ming-Ting Sun2
Shanghai Jiao Tong University, China, 2University of Washington, USA
9:30 - 09:45
ALPS 1.0: Towards Automated Lecture Profiling System
Pratibha Kumari1, Prakhar Jain1, Swarna Sahay1, Gan Tian2, Mukesh Saini1 1Indian Institute of Technology Ropar, India, 2Shandong University, China 9:45 - 10:00
VAS360: QOE-DRIVEN VIEWPORT ADAPTIVE STREAMING FOR 360 VIDEO
Yuxiang Hu, Yu Liu, Yumei wang
Beijing University Posts and Telecommunications, China
10:00 - 10:30 Coffee Break
10:30 - 11:30 Oral Session 2:
Session Chair: Tian Gan
10:30 - 10:45
FUSING GEOGRAPHIC INFORMATION INTO LATENT FACTOR MODEL FOR PICK-UP REGION RECOMMENDATION
Zhuhua Liao, Jian Zhang, Yizhi Liu
Hunan University of Science & Technology, China
10:45 - 11:00
A FLEXIBLE VIEWPORT-ADAPTIVE PROCESSING MECHANISM FOR REAL-TIME VR VIDEO TRANSMISSION
Anyue Xu, Xinyu Chen, Yu Liu, Yumei Wang
Beijing University Posts and Telecommunications, China
11:00 - 11:15
OBJECTIVE QUALITY ASSESSMENT METHOD FOR STEREOSCOPIC IMAGE RETARGETING
Salah Addin Mohammed M Mohammed, Ya Zhou, Zhibo Chen, Houqiang Li
University of Science and Technology of China, China
11:15 - 11:30
OPTIMAL MULTI-CODEC ADAPTIVE BITRATE STREAMING
Yuriy Reznik, Xinagbo Li, Karl Lillevold, Abhijith Jagannath, Justin Greer
Brightcove Inc. USA
11:30 - 12:00
Best Paper Award Announcement
W-08: Time-sequenced Multimedia Computing
Time: 13:30 - 17:45 PM
Room: 5F
Organizers: Wei Li Fudan University, China
Mengyao Zhu Shanghai University, China
Bing-Kun Bao Nanjing University of Posts and Telecommunications, Nanjing, China Min Xu University of Technology Sydney, Australia
Xi Shao Nanjing University of Posts and Telecommunications, Nanjing, China
13:30 - 13:55
AUDIO SCENE CLASSIFICATION WITH DISCRIMINATIVELY-TRAINED SEGMENT-LEVEL FEA- TURES
Haichuan Bai1,2, Hangting Chen1,2, Yonghong Yan1,2
1Chinese Academy of Sciences, China, 2University of Chinese Academy of Sciences, China
13:55 - 14:20
EFFICIENT IMPLICIT FOURIER COMPRESSION BASED CONVOLUTIONAL FEATURES FOR VISUAL TRACKING
Ridong Zhu, Xiaoyuan Yang, Jingkai Wang, Zhengze Li
Beihang University, China
14:20 - 14:45
AUDIO2FACE: GENERATING SPEECH/FACE ANIMATION FROM SINGLE AUDIO WITH ATTEN- TION-BASED BIDIRECTIONAL LSTM NETWORKS
Guanzhong Tian1, Yi Yuan2, Yong Liu1
1Zhejiang University, China, 2Fuxi AI Lab, Netease, China
14:45 - 15:10
DEEP VOCODER: LOW BIT RATE COMPRESSION OF SPEECH WITH DEEP AUTOENCODER
Gang Min1, Changqing Zhang 1, Xiongwei Zhang 2, Wei Tan1
1National University of Defense Technology, China 2Army Engineering University of PLA, China
15:10 - 15:30 Coffee Break
15:30 - 15:55
BLIND ESTIMATION OF REVERBERATION TIME USING BINAURAL COMPLEX IDEAL RATIO MASK
MingYang Chai1, TianTian Li1, MengYao Zhu1, Tao Wang1, Wen Zhang2
1Shanghai University, China, 2Northwestern Polytechnical University, China
15:55 - 16:20
OPV: BIAS CORRECTION BASED OPTIMAL PROBABILISTIC VIEWPORT-ADAPTIVE STREAMING FOR 360-DEGREE VIDEO
Weihong Lin, Xinggong Zhang, Zongming Guo, Wei Hu
Peking University, China
16:20 - 16:45
SVD-BASED CHANNEL PRUNING FOR CONVOLUTIONAL NEURAL NETWORK IN ACOUSTIC SCENE CLASSIFICATION MODEL
Jun Wang1, Shengchen Li1, Wenwu Wang2
1Beijing University of Posts and Telecommunications, China, 2University of Surrey, UK
16:45 - 17:10
MULTI-LEVEL ATTENTION MODEL WITH DEEP SCATTERING SPECTRUM FOR ACOUSTIC SCENE CLASSIFICATION
Zhitong Li1, Yuanbo Hou2, Xiang Xie1,3, Shengchen Li2, Liqiang Zhang1, Shixuan Du1, Wei Liu1
1Beijing Institute of Technology, China, 2Beijing University of Posts and Telecommunications, China, 3Beijing
Institute of Technology, China
17:10 - 17:45
A MULTI-CRITERIA SUBJECTIVE EVALUATION METHOD FOR BINAURAL AUDIO RENDERING TECHNIQUES IN VIRTUAL REALITY APPLICATIONS
Zhaoyu Yan, Jing Wang, Zhuoran Li
Beijing Institute of Technology, China
W-09: Smart Camera(Gigavision)
Time: 8:30 AM - 12:00 PM
Room: 5I
Organizers: Lu Fang Associate Professor, Tsinghua-Berkeley Shenzhen Institute, China David J. Brady Duke University, USA
Shenghua Gao Assistant Professor, ShanghaiTech University, China Yuchen Guo Tsinghua University, China
8:30 - 8:35 Opening Remarks:
Lu Fang Tsinghua University, China
8:35 - 9:15 Plenary Talk:
David J. Brady Duke University, USA
9:15 - 9:40 Keynote Talk:
Lu Fang Tsinghua University, China
9:40 - 10:05 Oral Session 1:
Session Chair: Lu Fang
SCALE-ADAPTIVE CNN BASED CROWD COUNTING AND DYNAMIC SUPERVISION
Zhengxin Li1, Jing Li1, Ling Xie1, Jianli Liu2
1ShanghaiTech University, Shanghai, China, 2Jiangnan University, Wuxi, China
SPATIAL-TEMPORAL CODEC ACCURACY CALIBRATION FOR MULTI-SCALE GIGA-PIXEL MACRO- SCOPE
Lei WANG, Jinli SUO, Jingtao FAN
Tsinghua University, China
10:05 - 10:20 Coffee Break
10:20 - 10:45 Keynote Talk:
Zhan Ma Nanjing University, China
10:45 - 11:10 Keynote Talk:
Shenghua Gao ShanghaiTech University, China
11:10 - 11:35 Keynote Talk:
Xing Lin Tsinghua University, China
11:35 - 12:00 Oral Session 2:
Session Chair: Lu Fang
SEGMENTATION OF BUILDING FOOTPRINTS WITH XCEPTION AND IOULOSS
Kepeng Xu1, Yunye Zhang1, Wenxin Yu1, Zhiqiang Zhang1, Jingwei Lu2, Yibo Fan3, Gang He4, Zhuo Yang5
1Southwest University of Science and Technology, China, 2Cadence Design Systems, Inc, 3Fudan University, China 4Xidian University, China 5Guangdong University of Technology, China
GIGAPIXEL-LEVEL IMAGE CROWD COUNTING USING CSRNET
Zhijie Cao1, Renyou Yan2, Yiyong Huang3, Zhiru Shi4
1Shanghai Jiao Tong University, China, 2ShanghaiTech University, China, 3Shanghai University, China, 4Yoke
Intelligence, China
W-10: Cross-media Big Data Analysis for Semantic Knowledge Understanding
Time: 13:30 AM - 18:00 PM
Room: 5I
Organizers: Yang Yang University of Electronic Science and Technology of China, China.
Yang Wang Dalian University of Technology, China.
Xing Xu University of Electronic Science and Technology of China, China. Zi Huang University of Queensland, Australia.
13:30 - 13:35 Opening Remarks
13:35 - 14:05 Keynote 1: Tentative
14:05 - 15:35 Oral Session 1: Knowledge Transfer Methods in Vision and Language
Session Chair: Yang Yang
14:05 - 14:20
MASK-GUIDED STYLE TRANSFER NETWORK FOR PURIFYING REAL IMAGES
Tongtong Zhao, Yuxiao Yan, Jinjia Peng, Huibing Wang, Xianping Fu
Dalian Maritime University, China
14:20 - 14:35
IMITATION LEARNING FOR SENTENCE GENERATION WITH DILATED CONVOLUTIONS USING ADVERSARIAL TRAINING
JianWei Peng1, MinChun Hu1, ChuanWang Chang2
1National Cheng Kung University, Taiwan, 2Kun Shan University, Taiwan
14:35 - 14:50
NON-RIGID 3D SHAPE RETRIEVAL BASED ON MULTI-VIEW METRIC LEARNING
Haohao Li, Shengfa Wang, Nannan Li, Zhixun Su, Ximin
Dalian University of Technology, China
14:50 - 15:05
WHAT TOPICS DO IMAGES SAY: A NEURAL IMAGE CAPTIONING MODEL WITH TOPIC REPRESEN- TATION
Feng Chen, Songxian Xie, Xinyi Li, Shasha Li, Jintao Tang, Ting Wang
National University of Defense Technology, China
15:05 - 15:30 Coffee Break
15:30 - 16:00 Keynote 2: Tentative
16:00 - 16:30 Oral Session 1: Knowledge Transfer Methods in Vision and Language
Session Chair: Yang Yang
16:00 - 16:15
CROSS DOMAIN KNOWLEDGE TRANSFER FOR UNSUPERVISED VEHICLE RE-IDENTIFICATION
Jinjia Peng, Huibing Wang, Tongtong Zhao and Xianping Fu
Dalian Maritime University, China
16:15 - 16:30
CYCLE-CONSISTENT DIVERSE IMAGE SYNTHESIS FROM NATURAL LANGUAGE
Zhi Chen, Yadan Luo
The University of Queensland, Australia
16:30 - 18:00 Session 2: Knowledge Transfer Related Application
Session chair: Yang Wang
16:30 - 16:45
SELF-WEIGHTED MULTIVIEW METRIC LEARNING BY MAXIMIZING THE CROSS CORRELATIONS
Huibing Wang, Jinjia Peng and Xianping Fu
Dalian Maritime University, China
16:45 - 17:00
CAUSATION-DRIVEN VISUALIZATIONS FOR INSURANCE RECOMMENDATION
Zhixiu Liu1, Chengxi Zang2, Kun Kuang1, Hao Zou1, Hu Zheng3, Peng Cui1
1Tsinghua University, China, 2Cornell University, USA, 3Datebao Insurance Ltd, China
17:00 - 17:15
CROSS-MODAL TRANSFER HASHING BASED ON COHERENT PROJECTION
En Yu1,2, Jiande Sun1, Li Wang1, Xiaojun Chang3, Huaxiang Zhang1, Alexander G. Hauptmann2 1Shandong Normal University, China, 2Carnegie Mellon University, USA, 3Monash University, Australia 17:15 - 17:30
17:30 - 17:45
RELATION NETWORK FOR HYPERSPECTRAL IMAGE CLASSIFICATION
Bin Deng (Shenzhen University)*; Daming Shi (College of Computer Science and Software Engineering, Shen- zhen University)
Tianjin University, China
17:45 - 18:00
ANNOTATING 3D MODELS AND THEIR PARTS VIA DEEP FEATURE EMBEDDING
Kouki Omata, Takahiko Furuya, Ryutarou Ohbuchi
University of Yamanashi, Japan
W-11: AI TechThology for Visual FashioTh ComputiThg
Time: 8:30 - 9:50 AM
Room: 5J
Organizers: Wei Zhang JD AI Research, China Ting Yao JD AI Research, China
Wen-Huang Cheng National Chiao Tung University, Taiwan
8:30 - 8:35 Opening Remarks
Session Chairs: Wei Zhang JD AI Research, China
8:35 - 9:00
DISENTANGLED HUMAN ACTION VIDEO GENERATION VIA DECOUPLED LEARNING
Lingbo Yang1, Zhenghui Zhao1, Shiqi Wang2, Shanshe Wang1, Siwei Ma1, Wen Gao1
1Peking University, China, 2City University of Hong Kong, China
9:00 - 9:25
PERSONALIZED IMAGE RECOMMENDATION WITH PHOTO IMPORTANCE AND USER-ITEM IN- TERACTIVE ATTENTION
Wan Zhang, Zepeng Wang, Tao Chen Hefei University of Technology, China 9:25 - 9:50
PARTIALLY OCCLUDED HEAD POSTURE ESTIMATION FOR 2D IMAGES USING PYRAMID HOG FEATURES
Jun Wu1, Z. Shang1, K. Wang1, J. Zhai1, Y. Wang1, F. Xia1, W. Li1, J. Zhang1, Fan Zhang2
1Northwestern Polytechnical University, China, 2Zhejiang University, China
Friday, July 12, 2019
Friday, July 12, 2019
W-12: 2nd IEEE International Workshop on Faces in Multimedia(FacesMM)
Time: 10:30 - 12:00 AM
Room: 5J
Organizers: Yun Fu Northeastern University, China
Joseph P Robinson Northeastern University, China Ming Shao University of Massachusetts, Dartmouth Siyu Xia Southeast University, China
10:30 - 10:35 Opening Remarks: Joseph P Robinson
10:35 - 11:15 Keynote Talk: Di Huang Beihang University, China 11:15 - 11:30
ADAPTIVE SALIENCE PRESERVING POOLING FOR DEEP CONVOLUTIONAL NEURAL NETWORKS
Yu Zhenyu1, Dai Shiyu1, Xing Yuxiang2
1Nuctech Company Limited, China, 2Tsinghua University, China
11:30 - 11:45
FULLY AUTOMATIC PHOTOREALISTIC FACIAL EXPRESSION AND EYE GAZE TRANSFER WITH A SINGLE IMAGE
Wanxin Xu, Sen-ching Cheung
University of Kentucky, USA
11:45 - 12:00
DEEP DOMAIN ADAPTATION FOR ASIAN FACE RECOGNITION VIA ADA-IBN
Chen Qian, Yi Jin, Yidong Li, Congyan Lang, Songhe Feng, Tao Wang
Beijing Jiaotong University, China
W-13: The Third Workshop on Human Identification in Multimedia (HIM)
Time: 13:30 - 17:30 PM
Room: 5J
Organizers: Liangliang Ren Department of Automation University of Tsinghua University, China Guangyi Chen Dept. of Automation University of Tsinghua University, China
Dr. Jiwen Lu Department of Automation Tsinghua University, China
13:30 - 13:35 Introduction
13:35 - 14:25 Invited Talk: Person Re-identification
Weishi Zheng
14:25 - 14:55 Oral Session 1: Human Identification
Session chair: Liangliang Ren
14:25 - 14:40
SIMILARITY PRESERVED CAMERA-TO-CAMERA GAN FOR PERSON RE-IDENTIFICATION
Jianlei Liu1, Yun Zhou2, Lingchuan Sun1, Zhuqing Jiang1
1Beijing University of Posts and Telecommunications, China, 2Academy of Broadcasting Science, China
14:40 - 14:55
UNSUPERVISED DOMAIN ADAPTATION FOR DISGUISED FACE RECOGNITION
Fangyu Wu1,2, Shiyang Yan3, Jeremy S. Smith2, Wenjin Lu1, Bailing Zhang4
1Xi’an Jiaotong-liverpool Universit, China, 2University of Liverpool, Liverpool, 3Queen’s University Belfast, UK, 4Zhejiang University, China
15:00 - 15:30 Coffee Break
15:30 - 16:45 Oral Session 2: Detection and Tracking
Session chair: Guangyi Chen
15:30 - 15:45
DUAL-CYCLE DEEP REINFORCEMENT LEARNING FOR STABILIZING FACE TRACKING
Congcong Zhu, Zhenhua Yu, Suping Wu, Hao Liu
Ningxia University, China
15:45 - 16:00
MULTI-TASK LEARNING FOR PEDESTRIAN BODY PARTS DETECTION AND MULTI-ATTRIBUTE
CLASSIFICATION
Miaomiao Lou1,2, Lin Chen1, Feng Guo2
1Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Science,China 2Chengdu Univer- sity of Information Technology,China
16:00 - 16:15
CONTEXT ATTENTION MODULE FOR HUMAN HAND DETECTION
Zhihuai Xie1, Shaojie Wang2, Wentian Zhao2, Zhenhua Guo1
1Department of Information Science and Technology, Graduate School at Shenzhen, Tsinghua University, China,
2Department of Computer Science, University of Rochester, USA
16:15 - 16:30
TOWARD ROBUST ONLINE ADAPTIVE VISUAL TRACKING VIA PYRAMIDAL FEATURES EX- TRACTION
Shuai Bai1, Yuan Dong1, Ting-Bing Xu2, Hongliang Bai3
1Beijing University of Posts and Telecommunications, China, 2Institute of Automation of Chinese Academy of Sciences, China, 3Beijing FaceAll Co., China
16:30 - 16:45
IMPROVING HUMAN POSE ESTIMATION WITH SELF-ATTENTION GENERATIVE ADVERSARIAL NETWORKS
Zhongzheng Cao, Rui Wang, Xiangyang Wang, Zhi Liu, Xiaoqiang Zhu
Shanghai University, China
16:45 - 17:30 Oral Session 3: Multimedia Processing
Session chair: Liangliang Ren
16:45 - 17:00
COLLABORATIVE REPRESENTATION GUIDED GRAPH LEARNING FOR VISUAL CLASSIFICATION
Sheng Huang, Yongxin Ge, Feiyu Chen, Kewen He, Xiaohong Zhang
Chongqing University, China
17:00 - 17:15
SPORTS HIGHLIGHTS GENERATION USING DECOMPOSED AUDIO INFORMATION
Muhammad Rafiqul Islam, Manoranjan Paul, Michael Antolovich, Ashad Kabir
Charles Sturt University, Australia
17:15 - 17:30
NEW BENCHMARK DATASETS AND A CHARACTER IDENTIFICATION SYSTEM ON TV SERIES
Zhuo Lei1, Qian Zhang2, Guoping Qiu3,4
1The University of Nottingh Ningbo China, 2University of Nottingh Ningbo China, 3Shenzhen University, China,
4University of Nottingham, UK