Dima Damen

Professor of Computer Vision, School of Computer Science,
Lead of Machine Learning and Computer Vision Group, University of Bristol
EPSRC Early Career Fellow (2020-2025)

Senior Research Scientist, Google DeepMind

Publications

Can also be found on Google Scholar

Arxiv

(2024) T Soucek, P Gatti, M Wray, I Laptev, D Damen, J Sivic. ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions. ArXiv | Website | Code | Dataset [Coming Soon]

(2024) J Huh*, J Chalk*, E Kazakos, D Damen, A Zisserman. EPIC-SOUDNS: A Large-Scale Dataset of Actions that Sound. (Extended Journal Version Under Review) ArXiv

(2024) S Bansal, M Wray, D Damen. HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision. ArXiv | Website | HOI-QA Dataset | Models and Code

(2024) Z Zhu and D Damen. Get a Grip: Reconstructing Hand-Object Stable Grasps in Egocentric Videos. ArXiv | Website and Videos | EPIC-Grasps Dataset and Code

(2021) J Munro, M Wray, D Larlus, G Csurka, D Damen. Domain Adaptation in Multi-View Embedding for Cross-Modal Video Retrieval. ArXiv

Peer-Reviewed Papers

2025

(2025) C Plizzari, S Goel, T Perrett, J Chalk, A Kanazawa, D Damen. Spatial Cognition from Egocentric Video: Out of Sight, Not Out of Mind. 3DV ArXiv | Website | Video

(2025) A Darkhalil, R Guerrier, A W Harley, D Damen EgoPoints: Advancing Point Tracking for Egocentric Videos. IEEE/CVF Winter Conference on Applications of Computer Vision (WACV). ArXiv | Webpage | Code and Benchmark

(2025) K Flanagan, D Damen, M Wray. Moment of Untruth: Dealing with Negative Queries in Video Moment Retrieval. IEEE/CVF Winter Conference on Applications of Computer Vision (WACV). [Coming Soon]

2024

(2024) T Perrett, T Han, D Damen, A Zisserman. It's Just Another Day: Unique Video Captioning by Discriminative Prompting. Asian Conference on Computer Vision (ACCV). Oral PDF ArXiv Preprint | Project Webpage | Code and Benchmark

(2024) S Sinha, A Stergiou, D Damen. Every Shot Counts: Using Exemplars for Repetition Counting in Videos. Asian Conference on Computer Vision (ACCV). ArXiv Preprint | Project Webpage | Code | Video

(2024) K Grauman et al. Ego4D: Around the World in 3,000 Hours of Egocentric Video. IEEE Transactions of Pattern Analysis and Machine Intelligence. Early Access PDF

(2024) G Goletto, T Nagarajan, G Averta, D Damen. AMEGO: Active Memory from long EGOcentric videos. European Conference on Computer Vision (ECCV). ArXiv Preprint | Webpage | Benchmark | Code

(2024) B Zhu, K Flanagan, A Fragomeni, M Wray, D Damen. Video Editing for Video Retrieval. Europen conference on Computer Vision Workshops (ECCVW) ArXiv

(2024) C Plizzari*, G Goletto*, A Furnari*, S Bansal*, F Ragusa*, GM Farinella, D Damen, T Tommasi. An Outlook into the Future of Egocentric Vision. International Journal of Computer Vision (IJCV). Published Vol 138 IJCV | Accepted - Online May 2024: IJCV PDF | OpenReview | ArXiv

(2024) J Chalk, J Huh, E Kazakos, A Zisserman, D Damen. TIM: A Time Interval Machine for Audio-Visual Video Understand. IEEE/CVF Computer Vision and Pattern Recognition (CVPR). Webpage | Code and Models | ArXiv | < a href="https://openaccess.thecvf.com/content/CVPR2024/papers/Chalk_TIM_A_Time_Interval_Machine_for_Audio-Visual_Action_Recognition_CVPR_2024_paper.pdf">CVF PDF | CVF Page

(2024) T Soucek, D Damen, M Wray, I Laptev, J Sivic. GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos. IEEE/CVF Computer Vision and Pattern Recognition (CVPR). ArXiv | Website | Code | CVF Open Access | CVF PDF

(2024) K Grauman et al. Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives. IEEE/CVF Computer Vision and Pattern Recognition (CVPR). ArXiv | Website and Dataset | CVF Open Access | CVF PDF

(2024) J Carreira, M King, V Patraucean, D Gokay, C Ionescu, Y Yang, D Zoran, J Heyward, C Doersch, Y Aytar, D Damen, A Zisserman. Learning from One Continuous Video Stream. IEEE/CVF Computer Vision and Pattern Recognition (CVPR). ArXiv | CVF PDF | CVF Open Access

(2024) D Yang, D Tjia, J H Berg, D Damen, P Agrawal, A Gupta. Rank2Reward: Learning Shaped Reward Functions from Passive Video. IEEE International Conference on Robotics and Automation (ICRA). ArXiv | Project Webpage and Videos

2023

(2023) V Tschernezki*, A Darkhalil*, Z Zhu*, D Fouhey, I Laina, D Larlus, D Damen, A Vedaldi. EPIC Fields: Marrying 3D Geometry and Video Understanding. Neural Information Processing Systems (NeurIPS). Preprint | Project Webpage | Dataset | Code | Video

(2023) V Patraucean, L Smaira, A Gupta, A Recasens, Y Yang, M Malinowski, C Doersch, L Markeeva, Y Sulsky, D Banarse, S Koppula, T Matejovicova, A Miech, A Frechette, J Zhang, H Klimczak, S Winkler, Y Aytar, R Koster, S Osindero, D Damen, A Zisserman, J Carreira. Perception Test: A Diagnostic Benchmark for Multimodal Models. Neural Information Processing Systems (NeurIPS). Preprint | Dataset and Code | Colab

(2023) K Flanagan, D Damen, M Wray. Learning Temporal Sentence Grounding From Narrated EgoVideos. British Machine Vision Conference (BMVC). ArXiv Camera Ready | Project Webpage | Code and Models

(2023) C Plizzari, T Perrett, B Caputo, D Damen. What can a cook in Italy teach a mechanic in India? Action Recognition Generalisation Over Scenarios and Locations. IEEE/CVF International Conference on Computer Vision (ICCV). CVF PDF | Preprint | Project Webpage | Dataset | Code | Video

(2023) T Perrett, S Sinha, T Perrett, M Mirmehdi, D Damen. Use Your Head: Improving Long-Tail Video Recognition. IEEE/CVF Computer Vision and Pattern Recognition (CVPR). CVF PDF | CVF Supp | ArXiv | Benchmark, Code and Models | Project Webpage

(2023) A Stergiou, D Damen. The Wisdom of Crowds: Temporal Progressive Attention for Early Action Prediction. IEEE/CVF Computer Vision and Pattern Recognition (CVPR). CVF PDF | CVF Supp | ArXiv | Project Webpage | code [Preliminary]

(2023) J Huh*, J Chalk*, E Kazakos, D Damen, A Zisserman. EPIC-SOUDNS: A Large-Scale Dataset of Actions that Sound. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). ArXiv Preprint | Webpage | Dataset, Code and Baseline Models | Audio Recognition Challenge

(2023) A Stergiou, D Damen. Play It Back: Iterative Attention for Audio Recognition. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). ArXiv Preprint | Code | Website

(2023) H Wang, M Mirmehdi, D Damen, T Perrett. Centre Stage: Centricity-based Audio-Visual Temporal Action Detection. BMVC Workshops. ArXiv

2022

(2022) A Darkhalil, D Shan, B Zhu, J Ma, A Kar, R Higgins, S Fidler, D Fouhey, D Damen. EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations. Neural Information Processing Systems (NeurIPS) Datasets and Benchmarks Track. ArXiv | Paper and Reviews | Project Webpage | Download | Trailer

(2022). A Fragomeni, M Wray, D Damen. ConTra: (Con)text (Tra)nsformer for Cross-Modal Video Retrieval. Asian Conference for Computer Vision (ACCV). Accepted as Oral. ArXiv | PDF Preprint | Project Webpage | Code | Video

(2022) K Q Lin, A J Wang, M Soldan, M Wray, R Yan, E Z Xu, D Gao, R Tu, W Zhao, W Kong, C Cai, H Wang, D Damen, B Ghanem, W Liu, M Z Shou. Egocentric Video-Language Pretraining. Neural Information Processing Systems (NeurIPS). ArXiv Preprint | Code

(2022) W Price, C Vondrick, D Damen. UnweaveNet: Unweaving Activity Stories. IEEE/CVF Computer Vision and Pattern Recognition (CVPR). PDF | ArXiv Preprint | Annotations | Project Webpage | Video

(2022) K Grauman et al. Around the World in 3,000 Hours of Egocentric Video. IEEE/CVF Computer Vision and Pattern Recognition (CVPR). ArXiv Preprint | Ego4D Project and Dataset

(2022) J Ma, D Damen. Hand-Object Interaction Reasoning. IEEE Conf. on Advanced Video and Signal-Based Surveillance (AVSS). Preprint

(2022) H. Wang, D Damen, M Mirmehdi, T Perrett. Refining Action Boundaries for One-stage Detection. IEEE Conf. on Advanced Video and Signal-Based Surveillance (AVSS).

(2022) T Perrett, A Masullo, D Damen, T Burghardt, I Craddock, M Mirmehdi. Personalized Energy Expenditure Estimation: Visual Sensing Approach With Deep Learning. JMIR Formative Research, vol 5 (9), . PDF

(2022) V Popescu, D Damen, T Perrett. An Evaluation of OCR on Egocentric Data. CVPR Workshops. Abstract

(2022) D Bazazian, A Calway, D Damen. Dual-Domain Image Synthesis using Segmentation-Guided GAN. IEEE/CVF Computer Vision and Pattern Recognition Workshops (CVPRW). PDF | ArXiv | Code

(2022) H Wang, D Damen, M Mirmehdi, T Perrett. TVNet: Temporal Voting Network for Action Localization. International Conference on Computer Vision Theory and Applications (VISAPP). ArXiv | Code

2021

(2021) E Kazakos, J Huh, A Nagrani, A Zisserman, D Damen. With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition. British Machine Vision Conference (BMVC). ArXiv | Project and Code | Video | Code, features and models

(2021) D Damen, H Doughty, GM Farinella, A Furnari, J Ma, E Kazakos, D Moltisanti, J Munro, T Perrett, W Price, M Wray. Rescaling Egocentric Vision: Collection Pipeline and Challenges for EPIC-KITCHENS-100. International Journal of Computer Vision (IJCV). (Early Access: HTML and PDF) | ArXiv (Sep 2021, v1 June 2020) | Project and Dataset

(2021) D Damen, H Doughty, GM Farinella, S Fidler, A Furnari, E Kazakos, D Moltisanti, J Munro, T Perrett, W Price, M Wray. The EPIC-KITCHENS Dataset: Collection, Challenges and Baselines. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), vol. 43, no. 11, pp. 4125-4141. IEEE | Arxiv Preprint

(2021) M Wray, H Doughty and D Damen. On Semantic Similarity in Video Retrieval. IEEE/CVF Computer Vision and Pattern Recognition (CVPR). CVF PDF | ArXiv Camera Ready | Project Details | Video

(2021) T Perrett, T Burghardt, A Masullo, M Mirmehdi and D Damen. Temporal-Relational CrossTransformers for Few-Shot Action Recognition. IEEE/CVF Computer Vision and Pattern Recognition (CVPR). CVF PDF | ArXiv Camera Ready | Project Details | Code

(2021) E Kazakos, A Nagrani, A Zisserman, D Damen. Slow-Fast Auditory Streams for Audio Recognition. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). (Accepted) ArXiv Preprint, IEEE PDF, Code and Models, Project Webpage [Outstanding Paper]

(2021) B Sullivan, C Ludwig, D Damen, W Mayol-Cuevas, I Gilchrist. Look-Ahead Fixations During Visuomotor Behavior: Evidence from Assembling a Camping Tent. Journal of Vision 21(3):13. PDF

(2021) L Chen, Y Nakamura, K Kondo, D Damen, W Mayol-Cuevas. Integration of Experts’ and Beginners’ Machine Operation Experiences to Obtain a Detailed Task Model. IEICE TRANSACTIONS on Information and Systems. Vol.E104-D(1) Jan 2021, pp 152-161. PDF, Preprint

(2021) A Masullo, T Perrett, D Damen, T Burghardt, M Mirmehdi. No Need for a Lab: Towards Multi-Sensory Fusion for Ambient Assisted Living in Real-World Living Homes. International Conference on Computer Vision Theory and Applications (VISAPP).

2020

(2020) W Price, D Damen. Play Fair: Frame Attribution in Video Models. Asian Conference on Computer Vision (ACCV). ArXiv Preprint | CVF PDF | Project Details | Interactive Dashboard | Teaser Video | Code

(2020) T Perrett, A Masullo, T Burghardt, M Mirmehdi, D Damen. Meta-Learning with Context-Agnostic Initialisations. Asian Conference on Computer Vision (ACCV) ArXiv Preprint | CVF PDF | Project Details | Talk Video

(2020) J Munro, D Damen. Multi-Modal Domain Adaptation for Fine-Grained Action Recognition. Computer Vision and Pattern Recognition (CVPR). Arxiv (Camera Ready) | CVF PDF | Project Details | Code | Oral Presentation Video | Results Video

(2020) H Doughty, W Mayol-Cuevas, I Laptev, D Damen. Action Modifiers: Learning from Adverbs in Instructional Videos. Computer Vision and Pattern Recognition (CVPR). Arxiv (Preprint) | CVF PDF | Project Details | Talk Video | Results Video

(2020) M Lagunes-Fortiz, D Damen, W Mayol. Centroids Triplet Network and Temporally-Consistent Embeddings for In-Situ Object Recognition. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Preprint PDF

(2020) L Chen, Y Nakamura, K Kondo, D Damen, W Mayol-Cuevas. Integration of Experts' and Beginners' Machine Operation Experiences to Obtain a Detailed Task Model. IEICE TRANSACTIONS on Information and Systems. Vol E104-D (1) - online Jan 2021.

(2020) A Masullo, T Burghardt, D Damen, T Perrett, M Mirmehdi. Person Re-ID by Fusion of Video Silhouettes and Wearable Signals for Home Monitoring Applications. Sensors 29(9) 2576. PDF

2019

(2019) M Wray, G Csurka, D Larlus, D Damen. Fine-Grained Action Retrieval through Multiple Parts-of-Speech Embeddings. International Conference on Computer Vision (ICCV). Arxiv prepring | CVF PDF | Video | Project Details

(2019) E Kazakos, A Nagrani, A Zisserman, D Damen. EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition. International Conference on Computer Vision (ICCV). Arxiv | CVF PDF | Results Video | Talk Video | Project Details

(2019) W Price, D Damen. Retro-Actions: Learning 'Close' by Time-Reversing 'Open' Videos. ICCV Workshop on Multi-Discipline Approach for Learning Concepts (MDALC). Arxiv Preprint | Project Details

(2019) F Heidarivincheh, M Mirmehdi, D Damen. Weakly-Supervised Completion Moment Detection using Temporal Attention. ICCV Workshop on Human Behaviour Understanding. Arxiv | CVF PDF

(2019) A Masullo, T Burghardt, D Damen, T Perrett, M Mirmehdi. Who Goes There? Exploiting Silhouettes and Wearable Signals for Subject Identification in Multi-Person Environments. ICCV Workshop on Computer Vision for Physiological Measurement.

(2019) M Wray, D Damen. Learning Visual Actions Using Multiple Verb-Only Labels. British Machine Vision Conference (BMVC). Arxiv Preprint | PDF | Video | Project Details

(2019) T Perrett, D Damen. DDLSTM: Dual-Domain LSTM for Cross-Dataset Action Recognition. Computer Vision and Pattern Recognition (CVPR). Prepring | Arxiv | Video | Project Details

(2019) D Moltisanti, S Fidler, D Damen. Action Recognition from Single Timestamp Supervision in Untrimmed Videos. Computer Vision and Pattern Recognition (CVPR).Project Details | PDF (preprint) | Arxiv | Code

(2019) H Doughty, W Mayol-Cuevas, D Damen. The Pros and Cons: Rank-aware Temporal Attention for Skill Determination in Long Videos. Computer Vision and Pattern Recognition (CVPR). arxiv | Project Details

(2019) M Lagunes-Fortiz, W Mayol-Cuevas, D Damen. Learning Discriminative Embeddings for Object Recognition on-the-fly. International Conference on Robotics and Automation (ICRA) PDF (preprint)

(2019) Y Jang, B Sullivan, C Ludwig, I.D. Gilchrist, D Damen, W Mayol-Cuevas. EPIC-Tent: An Egocentric Video Dataset for Camping Tent Assembly. International Conference on Computer Vision Workshop. PDF | Project Details | Dataset | Annotations | Video

(2019) L Chen, Y Nakamura, K Kondo, D Damen, W Mayol-Cuevas. Hotspots Integrating of Expert and Beginner Experiences of Machine OperatiFons through Egocentric Vision. Machine Vision and Applications (MVA).

(2019) A Elkholy, M Hussein, W Gomaa, D Damen, E Saba. Efficient and Robust Skeleton-Based Quality Assessment and Abnormality Detection in Human Action Performance. IEEE Journal of Biomedical and Health Informatics PDF (Early Access)

(2019) A Masullo,T Burghardt, T Perrett, D Damen, Majid Mirmehdi. Sit-to-Stand Analysis in the Wild using Silhouettes for Longitudinal Health Monitoring. International Conference on Image Analysis and Recognition (ICIAR). ArXiv Preprint

(2019) V Ponce-López, T Burghardt, Y Sun, S Hannuna, D Damen, M Mirmehdi. Deep Compact Person Re-Identification with Distractor Synthesis via Guided DC-GANs. International Conference on Image Analysis and Processing (ICIAP). PDF

(2019) B Sullivan, H Doughty, W Mayol-Cuevas, D Damen, C Ludwig, I Gilchrist. [ABSTRACT:] Detecting Uncertainty While Assembling a Camping Tent. European Conference on Visual Perception (ECVP).

2018

(2018) D Damen, H Doughty, GM Farinella, S Fidler, A Furnari, E Kazakos, D Moltisanti, J Munro, T Perrett, W Price, M Wray. Scaling Egocentric Vision: The EPIC-KITCHENS Dataset. European Conference on Computer Vision (ECCV). arxiv | CVF PDF | Dataset | Project Page

(2018) F Heidarivincheh, M Mirmehdi, D Damen. Action Completion: A Temporal Model for Moment Detection. British Machine Vision Conference (BMVC). Arxiv | Dataset | Project Page

(2018) A Masullo, T Burghardt, D Damen, S Hannuna, V Ponce-López, M Mirmehdi. CaloriNet: From silhouettes to calorie estimation in private environments. British Machine Vision Conference (BMVC). Arxiv

(2018) H Doughty, D Damen, W Mayol-Cuevas. Who's Better? Who's Best? Pairwise Deep Ranking for Skill Determination. Computer Vision and Pattern Recognition (CVPR). arxiv | Project Page | Dataset

(2018) Y Xu, D Damen. Human Routine Change Detection using Bayesian Modelling. International Conference on Pattern Recognition (ICPR) Preprint | Project Page

(2018) M. Lagunes-Fortiz, D Damen, W Mayol-Cuevas. Instance-level Object Recognition on Video Data using Deep Temporal Coherence. International Symposium on Visual Computing (ISVC).

(2018) V Ponce-López, T Burghardt, S Hannunna, D Damen, A Masullo, M Mirmehdi. Semantically selective augmentation for deep compact person re-identification. European Conference on Computer Vision Workshops (ECCVW). PDF CVF

(2018) V Soleimani, M Mirmehdi, D Damen, J Dodd. Markerless Active Trunk Shape Modelling for Motioin Tolerant Remote Respiratory Assessment. International Conference on Image Processing (ICIP).

(2018) V Soleimani, M Mirmehdi, D Damen, James Dodd, Massimo Camplani, Sion Hannuna, Charlie Sharp, Jason Viner. Depth-based Whole Body Photoplethysmography in Remote Pulmonary Function Testing. IEEE Transactions on Biomedical Engineering, vol 65(6), pp 1421 - 1431.

(2018) L Tao, T Burghardt, M Mirmehdi, D Damen, Ashley Cooper, Sion Hannuna, Massimo Camplani, Adeline Paiment, I Craddock. Energy Expenditure Estimation using Visual and Intertial Sensors. IET Computer Vision vol 12 (1) pp 36 - 47

(2018) F De Luca, D Damen, J Kurton, M Wray, RM Pokhrel, MJ Werner. Traffic data as proxy of business downtime after natural disasters: the case of Kathmandu. National Conference on Earthquake Engineering. PDF

2017

(2017) D Moltisanti, M Wray, W Mayol-Cuevas, D Damen. Trespassing the Boundaries: Labeling Temporal Bounds for Object Interactions in Egocentric Video. International Conference on Computer Vision (ICCV). pdf (camera ready) | arxiv | Project Page | video

(2017) T Perrett, D Damen. Recurrent Assistance: Cross-Dataset Training of LSTMs on Kitchen Tasks. Fifth Int. Workshop on Assistive Computer Vision and Robotics (ACVR). International Conference on Computer Vision Workshops (ICCVW). pdf (camera ready)

(2017) R Layne, S Hannuna, M Camplani, J Hall, T Hospedales, T Xiang, M Mirmehdi, D Damen. A Dataset for Persistent Multi-Target Multi-Camera Tracking in RGB-D. IEEE Computer Vision and Pattern Recognition Workshops (CVPRW) pdf

(2017) V Soleimani, M Mirmehdi, D Damen, S Hannuna, M Camplani. Remote, Depth-based Lung Function Assessment. IEEE Transactions on Biomedical Engineering, vol 64(8) pp 1943 - 1958 pdf

(2017) C Sharp, V Soleimani, S Hannuna, M Camplani, D Damen, J Viner, M Mirmehdi, and J Dodd. Toward Respiratory Assessment Using Depth Measurements from a Time-of-Flight Sensor. Fronteirs in Physiology 8:65. pdf

(2017) M Camplani, A Paiement, M Mirmehdi , D Damen, S Hannuna, T Burghardt, L Tao. Multiple human tracking in RGB-depth data: a survey. IET Computer Vision, vol 11 (4) pp 265-285 pdf | ArXiv

(2017) T Leelasawassuk, D Damen, W Mayol-Cuevas. Automated capture and delivery of assistive task guidance with an eyewear computer: The GlaciAR system. Augmented Human. pdf | ArXiv | video

(2017) Y Xu, D Bull, D Damen. Unsupervised Long-Term Routine Modelling using Dynamic Bayesian Networks. IEEE Int Conf on Digital Image Computing Technologies and Applications (DICTA). PDF

(2017) L Chen, K Kondo, Y Nakamura, D Damen, W Mayol-Cuevas. Hotspots Detection for Machine Operation in Egocentric Vision. Machine Vision Applications (MVA) pdf (TBA), video

(2017) S Audrey, U Leonard, D Damen, Shared Use Routes for People Who Walk or Cycle: Addressing the Challenges. International Conference for Transport and Health, vol 5, pp 57-58 (abstract)

(2017) A Elkholy, M Hussein, W Gomaa, Dima Damen, Emmanuel Saba. A general descriptor for detecting abnormal action performance from skeletal data. IEEE Engineering in Medicine and Biology Society (EMBC). pdf

2016

(2016) D Damen, T Leelasawassuk, W Mayol-Cuevas. You-Do, I-Learn: Egocentric Unsupervised Discovery of Objects and their Modes of Interaction Towards Video-Based Guidance. Computer Vision and Image Understanding (CVIU), vol 149 pp 98-112 August 2016. [pdf | arxiv preprint]

(2016) L Tao, A Paiment, D Damen, M Mirmehdi, S Hannuna, M Camplani, T Burghardt, I Craddock. A Comparative Study of Pose Representation and Dynamics Modelling for Online Motion Quality Assessment. Computer Vision and Image Understanding (CVIU), vol 148 pp 136-152 July 2016. [pdf | Preprint]

(2016) F Heidarivincheh, M Mirmehdi, D Damen. Beyond Action Recognition: Action Completion in RGB-D Data. British Machine Vision Conference (BMVC). pdf | abstract | video | project | dataset

(2016) V Soleimani, M Mirmehdi, D Damen, S Hannuna, M Camplani. 3D Data Acquisition and Registration using Two Opposing Kinects. 3D Vision (3DV). pdf | code

(2016) M Wray, D Moltisanti, W Mayol-Cuevas, D Damen. SEMBED: Semantic Embedding of Egocentric Action Videos. First International Workshop on Egocentric Percetion, Interaction and Computing (EPIC). European Conference on Computer Vision Workshops (ECCVW). pdf | supplementary | video | dataset | project

(2016) L Tao, T Burghardt, S Hannuna, M Camplani, A Paiement, D Damen, M Mirmehdi, I Craddock. Calorie Counter: RGB-Depth Visual Estimation of Energy Expenditure at Home. 13th Asian Conference on Computer Vision (ACCV 2016) Workshop on Assistive Vision. ArXiv | project

2015

(2015) G Bleser, D Damen, A Behera, G Hendeby, K Mura, M Miezal, A Gee, N Petersen, G Macaes, H Domingues, D Gorecky, L Almeida, W Mayol-Cuevas, A Calway, A Cohn, D Hogg, D Stricker. Cognitive Learning, Monitoring and Assistance of Industrial Workflows Using Egocentric Sensor Networks. PLOS ONE, 30 June 2015. pdf

(2015) M Camplani, S Hannuna, M Mirmehdi, D Damen, L Tao, T Burghardt, A Paiement. Real-time RGB-D Tracking with Depth Scaling Kernelised Correlation Filters and Occlusion Handling. British Machine Vision Conference (BMVC). pdf | abstract | code | project

(2015) T Leelasawassuk, D Damen, W Mayol-Cuevas. Estimating Visual Attention from a Head Mounted IMU. International Symposium on Wearable Computers (ISWC). pdf video

(2015) V Soleimani, M Mirmehdi, D Damen, S Hannuna, M Camplani, J Vinery, J Boddy. Remote Pulmonary Function Testing using a Depth Sensor. IEEE/CAS-EMB Biomedical Circuits and Systems Conference (BioCAS). pdf | video

(2015) Y Xu, D Bull, D Damen. Unsupervised Daily Routine Modelling from a Depth Sensor using Bottom-Up and Top-Down Hierarchies. Asian Conference on Pattern Recognition (ACPR). pdf

(2015) T Hodan, D Damen, W Mayol-Cuvas, J Matas. Efficient Texture-less Object Detection for Augmented Reality Guidance. Workshop on Visual Recognition and Retrieval for Mixed and Augmented Reality. IEEE Int. Symposium on Mixed and Augmented Reality (ISMAR) Workshop. pdf

(2015) L Tao, T Burghardt, S Hannuna, M Camplani, A Paiment, D Damen, M Mirmehdi, I Craddock. A Comparative Home Activity Monitoring Study using Visual and Inertial Sensors. IEEE Int. Conf. on e-Health Networking, Applications and Services (Healthcom)

(2015) P Woznowski, X Fafoutis, T Song, S Hannuna, M Camplani, L Tao, A Paiement, E Mellios, M Haghighi, N Zhu, G Hilton, D Damen, T Burghardt, M Mirmehdi, R Piechocki, D Kaleshi, I Craddock. A Multi-modal Sensor Infrastructure for Healthcare in a Residential Environment. IEEE ICC Workshop on ICT-enabled services and technologies for eHealth and Ambient Assisted Living

2014

(2014) D Damen, T Leelasawassuk, O Haines, A Calway, W Mayol-Cuevas. You-Do, I-Learn: Discovering Task Relevant Objects and their Modes of Interaction from Multi-User Egocentric Video. British Machine Vision Conference (BMVC), Nottingham, UK. pdf | abstract | video | dataset | project

(2014) A Paiment, L Tao, S Hannuna, M Camplani, D Damen, M Mirmehdi. Online quality assessment of human movement from skeleton data. British Machine Vision Conference (BMVC), Nottingham, UK. pdf | abstract | project and datasets

(2014) D Damen, O Haines, T Leelasawassuk, A Calway, W Mayol-Cuevas. Multi-user egocentric Online System for Unsupervised Assistance on Object Usage. Computer Vision - ECCV 2014 Workshops Proceedings - Part III, p. 481-492, Zurich, Switzerland. preprint

2012

(2012) D Damen, D Hogg. Detecting Carried Objects from Sequences of Walking Pedestrians. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) vol 34 (6) pp 1056-1067 pdf | project | video

(2012) D Damen, D Hogg. Explaining Activities as Consistent Groups of Events - A Bayesian Framework using Attribute Multiset Grammars. International Journal of Computer Vision (IJCV) vol 98 (1) pp 83-102. pdf | project

(2012) D Damen, P Bunnun, A Calway, W Mayol-Cuevas. Real-time Learning and Detection of 3D Texture-less Objects: A Scalable Approach. British Machine Vision Conference (BMVC) pdf | abstract | poster | code [*Best Poster Prize*]

(2012) D Damen, A Gee, W Mayol-Cuevas, A Calway. Egocentric Real-time Workspace Monitoring using an RGB-D Camera. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) pdf

(2012) P Bunnun, D Damen, A Calway, W Mayol-Cuevas. Integrating 3D Object Detection, Modelling and Tracking on a Mobile Phone. International Symposium on Mixed and Augmented Reality (ISMAR)

2011

(2011) D Damen, A Gee, A Calway, W Mayol-Cuevas. Detecting and Localising Multiple 3D Objects: A Fast and Scalable Approach. IROS Workshop on Active Semantic Perception and Object Search in the Real World (ASP-AVS-11) pdf

2010

(2010) P Bunnun, D Damen, S Subramanian, W Mayol-Cuevas. Interactive Image-Based Model Building for Handheld Devices. ISMAR Workshop on Augmented Reality Super Models pdf

2009

(2009) D Damen, D Hogg. Attribute Multiset Grammars for Global Explanations of Activities. British Machine Vision Conference (BMVC). pdf | abstract

(2009) D Damen, D Hogg. Recognizing Linked Events: Searching the Space of Feasible Explanations. Computer Vision and Pattern Recognition (CVPR) pdf | poster

2008

(2008) D Damen, D Hogg. Detecting Carried Objects in Short Video Sequences. European Conference on Computer Vision (ECCV) Springer-Verlag. 3,154-167 pdf poster demo

2007

(2007) D Damen, D Hogg. Associating People Dropping off and Picking up Objects. British Machine Vision Conference (BMVC). pdf Oral Presentation

(2007) D Damen, D Hogg. Bicycle Theft Detection. International Crime Science Conference. (CS2) pdf Oral Presentation

(2007) D Damen, D Hogg. Bicycle Theft Detection - How to deal with visual uncertainties. Make Some Noise (Faculty of Engineering Postgraduate Research Symposium). Faculty of Engineering, University of Leeds

Technical Reports

(2021) D Damen, A Fragomeni, J Munro, T Perrett, D Whettam, M Wray, A Furnari, G M Farinella, D Moltisanti. EPIC-KITCHENS-100- 2021 Challenges Report. PDF

(2020) D Damen and M Wray. Supervision Levels Scale (SLS). ArXiv

(2020) D Damen, E Kazakos, W Price, J Ma, H Doughty, A Furnari, GM Farinella. EPIC-KITCHENS-55 - 2020 Challenges Report. PDF

(2019) W Price, D Damen. An Evaluation of Action Recognition Models on EPIC-Kitchens. Arxiv | Github | PDF

(2019) D Damen, W Price, E Kazakos, A Furnari, GM Farinella. EPIC-KITCHENS - 2019 Challenges Report. PDF

(2016) S Gunner, D Damen. Potential Computer Vision Technologies for Monitoring Shared Spaces (Bristol Case Study). Commissioned by the Cabot Institute, University of Bristol Technical Reports

(2013) G Bleser, L Almeida, A Behera, A Calway, A Cohn, D Damen, H Domingues, A Gee, D Gorecky, D Hogg, M Kraly, G Macaes, F Marin, W Mayol-Cuevas, M Miezal, K Mura, N Petersen, N Vignais, L Paulo Santos, G Spaas, D Stricker. Cognitive Workflow Capturing and Rendering with On- Body Sensor Networks (COGNITO). German Research Center for Artificial Intelligence, DFKI Research Reports (RR), Vol. 13-02.

Editorial Work

Editors: Nalpantidis, Lazaros and Detry, Renaud and Damen, Dima and Bleser, Gabriele and Cakmak, Maya and Suphi Erden, Mustafa. Cognitive Robotics Systems: Concepts and Applications. Journal of Intelligent & Robotic Systems (June 2015). DOI: 10.1007/s10846-015-0244-9

Editors: Burghardt, Tilo and Damen, Dima and Mayol-Cuevas, Walterio and Mirmehdi, Majid. Correspondence, Matching and Recognition. International Journal of Computer Vision - Special Issue (May 2015) DOI:10.1007/s11263-015-0827-8

Editors: Burghardt, Tilo and Damen, Dima and Mayol-Cuevas, Walterio and Mirmehdi, Majid. Proceedings of the British Machine Vision Conference 2013. British Machine Vision Association (Bristol, September 2013).

Book Chapters

(2016) Woznowski et al. SPHERE: A Sensor Platform for Healthcare in a Residential Environment. Designing, Developing, and Facilitating Smart Cities. pdf

Theses

(2009) Activity Analysis: Finding Explanations for Sets of Events. PhD Thesis. University of Leeds pdf (6MB)

(2003) Visual Signature for Large Scale Tracking. MSc Thesis. University of Leeds pdf