HFCAS OpenIR
A convolution-transformer dual branch network for head-pose and occlusion facial expression recognition
Liang, Xingcan1,2; Xu, Linsen5; Zhang, Wenxiang3; Zhang, Yan4; Liu, Jinfu1,2; Liu, Zhipeng1,2
2022-02-13
发表期刊VISUAL COMPUTER
ISSN0178-2789
通讯作者Xu, Linsen(lsxu@hhu.edu.cn)
摘要Facial expression recognition (FER) has attracted much more attention due to its broad range of applications. Occlusions and head-pose variations are two major obstacles for automatic FER. In this paper, we propose a convolution-transformer dual branch network (CT-DBN) that takes advantage of local and global facial information to tackle the real-word occlusions and head-pose variant robust FER. The CT-DBN contains two branches. Taking into account local modeling ability of CNN, the first branch utilizes CNN to capture local edge information. Inspired by transformers' successful application in natural language processing, we employ transformer to the second branch to be responsible for obtaining better global representation. Then, a local-global feature fusion module is proposed to adaptively integrate both features to hybrid features and model the relationship between them. With the help of feature fusion module, our network not only integrates local and global features in an adaptive weighting manner but can also learn the corresponding distinguishable features autonomously. Experimental results under inner-database and cross-database evaluation on four leading facial expression databases illustrate that our proposed CT-DBN outperforms other state-of-the-art methods and achieves robust performance under in-the-wild condition.
关键词Facial expression recognition CNNs Transformers Feature fusion Robust on occlusions and head-pose variations
DOI10.1007/s00371-022-02413-5
收录类别SCI
语种英语
资助项目National Key R&D Program of China[2017YFB1303200] ; Jiangsu Special Project for Frontier Leading Base Technology[BK20192004] ; Key Support Project of Dean Fund of Hefei Institutes of Physical Science, CAS[YZJJZX202017] ; Strategic High-tech Innovation Fund of Chinese Academy of Sciences[GQRC-19-15]
项目资助者National Key R&D Program of China ; Jiangsu Special Project for Frontier Leading Base Technology ; Key Support Project of Dean Fund of Hefei Institutes of Physical Science, CAS ; Strategic High-tech Innovation Fund of Chinese Academy of Sciences
WOS研究方向Computer Science
WOS类目Computer Science, Software Engineering
WOS记录号WOS:000754455000001
出版者SPRINGER
引用统计
被引频次:17[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://ir.hfcas.ac.cn:8080/handle/334002/127809
专题中国科学院合肥物质科学研究院
通讯作者Xu, Linsen
作者单位1.Chinese Acad Sci, Inst Intelligent Machines, Hefei Inst Phys Sci, Hefei 230031, Peoples R China
2.Univ Sci & Technol China, Hefei 230026, Peoples R China
3.Changzhou Univ, Sch Microelect & Control Engn, Changzhou 213164, Peoples R China
4.Anhui Jianzhu Univ, Sch Elect & Informat Engn, Hefei 230009, Peoples R China
5.Hohai Univ, Coll Mech & Elect Engn, Changzhou 213022, Peoples R China
推荐引用方式
GB/T 7714
Liang, Xingcan,Xu, Linsen,Zhang, Wenxiang,et al. A convolution-transformer dual branch network for head-pose and occlusion facial expression recognition[J]. VISUAL COMPUTER,2022.
APA Liang, Xingcan,Xu, Linsen,Zhang, Wenxiang,Zhang, Yan,Liu, Jinfu,&Liu, Zhipeng.(2022).A convolution-transformer dual branch network for head-pose and occlusion facial expression recognition.VISUAL COMPUTER.
MLA Liang, Xingcan,et al."A convolution-transformer dual branch network for head-pose and occlusion facial expression recognition".VISUAL COMPUTER (2022).
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Liang, Xingcan]的文章
[Xu, Linsen]的文章
[Zhang, Wenxiang]的文章
百度学术
百度学术中相似的文章
[Liang, Xingcan]的文章
[Xu, Linsen]的文章
[Zhang, Wenxiang]的文章
必应学术
必应学术中相似的文章
[Liang, Xingcan]的文章
[Xu, Linsen]的文章
[Zhang, Wenxiang]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。