Multi-Frame Feature Aggregation for Real-Time Instrument Segmentation in Endoscopic Video
Lin, Shan1; Qin, Fangbo2; Peng, Haonan1; Bly, Randall A.3; Moe, Kris S.3; Hannaford, Blake1
刊名IEEE ROBOTICS AND AUTOMATION LETTERS
2021-10-01
卷号6期号:4页码:6773-6780
关键词Computer vision for medical robotics deep learning for visual perception object detection segmentation and categorization
ISSN号2377-3766
DOI10.1109/LRA.2021.3096156
通讯作者Lin, Shan(shanlin0331@gmail.com)
英文摘要Deep learning-based methods have achieved promising results on surgical instrument segmentation. However, the high computation cost may limit the application of deep models to time-sensitive tasks such as online surgical video analysis for robotic-assisted surgery. Moreover, current methods may still suffer from challenging conditions in surgical images such as various lighting conditions and the presence of blood. We propose a novel Multi-frame Feature Aggregation (MFFA) module to aggregate video frame features temporally and spatially in a recurrent mode. By distributing the computation load of deep feature extraction over sequential frames, we can use a lightweight encoder to reduce the computation costs at each time step. Moreover, public surgical videos usually are not labeled frame by frame, so we develop a method that can randomly synthesize a surgical frame sequence from a single labeled frame to assist network training. We demonstrate that our approach achieves superior performance to corresponding deeper segmentation models on two public surgery datasets.
资助项目National Science Foundation[IIS-2036255]
WOS研究方向Robotics
语种英语
出版者IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
WOS记录号WOS:000678343900013
资助机构National Science Foundation
内容类型期刊论文
源URL[http://ir.ia.ac.cn/handle/173211/45640]  
专题精密感知与控制研究中心_精密感知与控制
通讯作者Lin, Shan
作者单位1.Univ Washington, Dept Elect & Comp Engn, Seattle, WA 98195 USA
2.Chinese Acad Sci, Res Ctr Precis Sensing & Control, Inst Automat, Beijing 100190, Peoples R China
3.UW, Dept Otolaryngol Head & Neck Surg, Seattle, WA 98105 USA
推荐引用方式
GB/T 7714
Lin, Shan,Qin, Fangbo,Peng, Haonan,et al. Multi-Frame Feature Aggregation for Real-Time Instrument Segmentation in Endoscopic Video[J]. IEEE ROBOTICS AND AUTOMATION LETTERS,2021,6(4):6773-6780.
APA Lin, Shan,Qin, Fangbo,Peng, Haonan,Bly, Randall A.,Moe, Kris S.,&Hannaford, Blake.(2021).Multi-Frame Feature Aggregation for Real-Time Instrument Segmentation in Endoscopic Video.IEEE ROBOTICS AND AUTOMATION LETTERS,6(4),6773-6780.
MLA Lin, Shan,et al."Multi-Frame Feature Aggregation for Real-Time Instrument Segmentation in Endoscopic Video".IEEE ROBOTICS AND AUTOMATION LETTERS 6.4(2021):6773-6780.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace