Dima Damen

Professor of Computer Vision, School of Computer Science,
Lead of Machine Learning and Computer Vision Group, University of Bristol

Senior Staff Research Scientist, Google DeepMind

Publications

Can also be found on Google Scholar

Arxiv

(2025) M Hatano*, S Sinha*, J Chalk, W Li, H Saito, D Damen. Prime and Reach: Synthesising Body Motion for Gaze-Primed Object Reach. ArXiv | Website | Dataset

(2025) Z Zhu, Y Huang, Y Sato, D Damen. The N-Body Problem: Parallel Execution from Single-Person Egocentric Video. ArXiv | Webpage.

(2025) G Comanici et al. Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities. ArXiv

(2025) M Hatano, Z Zhu, H Saito, D Damen. The Invisible EgoHand: 3D Hand Forecasting through EgoBody Pose Estimation. ArXiv | Webpage | Code

(2024) J Carreira, D Gokey, M King, C Zhang, I Rocco, A Mahendran, T Keck, J Heyward, S Koppula, E Pot, G Erdogan, Y Hasson, Y Yang, K Greff, G Le Moing, S van Steenkiste, D Zoran, D Hudson, P Velez, L Polania, L Friedman, C Duvarney, R Goroshin, K Allen, J Walker, R Kabra, E Aboussouan, J Sun, T Kipf, C Doersch, V Patraucean, D Damen, P Luc, M Sajjadi, A Zisserman. Scaling 4D Representations. ArXiv

(2024) S Bansal, M Wray, D Damen. HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision. ArXiv | Website | HOI-QA Dataset | Models and Code

(2024) Z Zhu and D Damen. Get a Grip: Reconstructing Hand-Object Stable Grasps in Egocentric Videos. ArXiv | Website and Videos | EPIC-Grasps Dataset and Code

(2021) J Munro, M Wray, D Larlus, G Csurka, D Damen. Domain Adaptation in Multi-View Embedding for Cross-Modal Video Retrieval. ArXiv

Peer-Reviewed Papers

2026

(2026) T Han*, S Ebrahimi*, D Gokay, L Y Ku, M Ovsjanikov, I Babukova, D Zoran, V Patraucean, J Carreira, A Zisserman, D Damen. Unique Lives, Shared World: Learning from Single-Life Videos. IEEE/CVF Computer Vision and Pattern Recognition (CVPR) ArXiv

(2026) Z Xue, K Grauman, D Damen, A Zisserman, T Han. Seeing Without Pixels: Perception from Camera Trajectories. IEEE/CVF Computer Vision and Pattern Recognition (CVPR) ArXiv | Project Webpage

(2026) D Pujol-Perich, A Calpes, D Damen, S Escalera, M Wray. Beyond Caption-Based Queries for Video Moment Retrieval. IEEE/CVF Computer Vision and Pattern Recognition (CVPR) | ArXiv (Camera Ready) | Webpage

(2026) Z Zhu, S Bansal, S Tripathi, D Damen. Reconstructing Objects along Hand Interaction Timelines in Egocentric Video. IEEE/CVF Computer Vision and Pattern Recognition Workshops (CVPRW). ArXiv | Website

(2026) K Parida, O Emara, H Doughty, D Damen. Segmenting Collision Sound Sources in Egocentric Videos. IEEE/CVF Computer Vision and Pattern Recognition Workshops (CVPRW). ArXiv | Project Webpage | Dataset

(2026) T Perrett, T Han, D Damen, A Zisserman. It’s Just Another Day: Unique Video Captioning by Discriminitive Prompting. International Journal of Computer Vision (IJCV) 134, 60. Open Access [Journal extension of ACCV 2024 Best Paper]

(2026) R Guerrier, A Harley, D Damen. PointSt3R: Point Tracking through 3D Grounded Correspondence. IEEE/CVF Winter Conference on Applications of Computer Vision (WACV). ArXiv | Webpage | Code and Models

2025

(2025) K Grauman et al. Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives. International Journal of Computer Vision (IJCV). Open Access | PDF | Project Webpage and Dataset

(2025) A Fragomeni, D Damen, M Wray. Leveraging Modality Tags for Enhanced Cross-Modal Video Retrieval. British Machine Vision Conference (BMVC). ArXiv

(2025) J Huh*, J Chalk*, E Kazakos, D Damen, A Zisserman. EPIC-SOUDNS: A Large-Scale Dataset of Actions that Sound. IEEE Transactions on Pattern Analysis and Machine Intelligence 47, pp. 9953-9965. Journal Version (DOI), ArXiv

(2025) T Perrett, A Darkhalil, S Sinha, O Emara, S Pollard, K Parida, K Liu, P Gatti, S Bansal, K Flanagan, J Chalk, Z Zhu, R Guerrier, F Abdelazim, B Zhu, D Moltisanti, M Wray, H Doughty, D Damen. HD-EPIC: A Highly-Detailed Egocentric Video Dataset. IEEE/CVF Computer Vision and Pattern Recognition (CVPR) ArXiv | Webpage | Dataset | Annotations | Explore Dataset | CVF

(2025) T Soucek, P Gatti, M Wray, I Laptev, D Damen, J Sivic. ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions. IEEE/CVF Computer Vision and Pattern Recognition (CVPR) ArXiv | Website | Code and Dataset | CVF

(2025) K Roth, Z Akata, D Damen, I Balazevic, O J Henaff. Context-Aware Multimodal Pretraining. IEEE/CVF Computer Vision and Pattern Recognition (CVPR) ArXiv | CVF

(2025) T Han, D Gokay, J Heyward, C Zhang, D Zoran, V Patraucean, J Carreira, Dima Damen, A Zisserman. Learning from Streaming Video with Orthogonal Gradients. IEEE/CVF Computer Vision and Pattern Recognition (CVPR). ArXiv | CVF | Code

(2025) C Plizzari, S Goel, T Perrett, J Chalk, A Kanazawa, D Damen. Spatial Cognition from Egocentric Video: Out of Sight, Not Out of Mind. 3DV ArXiv | Website | Video

(2025) A Darkhalil, R Guerrier, A W Harley, D Damen EgoPoints: Advancing Point Tracking for Egocentric Videos. IEEE/CVF Winter Conference on Applications of Computer Vision (WACV). ArXiv | Webpage | Code and Benchmark

(2025) K Flanagan, D Damen, M Wray. Moment of Untruth: Dealing with Negative Queries in Video Moment Retrieval. IEEE/CVF Winter Conference on Applications of Computer Vision (WACV). ArXiv | Project Webpage | Dataset Splits and Code

2024

(2024) T Perrett, T Han, D Damen, A Zisserman. It's Just Another Day: Unique Video Captioning by Discriminative Prompting. Asian Conference on Computer Vision (ACCV). (Best Paper Award) PDF ArXiv Preprint | Project Webpage | Code and Benchmark

(2024) S Sinha, A Stergiou, D Damen. Every Shot Counts: Using Exemplars for Repetition Counting in Videos. Asian Conference on Computer Vision (ACCV). ArXiv Preprint | Project Webpage | Code | Video

(2024) K Grauman et al. Ego4D: Around the World in 3,000 Hours of Egocentric Video. IEEE Transactions of Pattern Analysis and Machine Intelligence. Early Access PDF

(2024) G Goletto, T Nagarajan, G Averta, D Damen. AMEGO: Active Memory from long EGOcentric videos. European Conference on Computer Vision (ECCV). ArXiv Preprint | Webpage | Benchmark | Code

(2024) B Zhu, K Flanagan, A Fragomeni, M Wray, D Damen. Video Editing for Video Retrieval. Europen conference on Computer Vision Workshops (ECCVW) ArXiv

(2024) C Plizzari*, G Goletto*, A Furnari*, S Bansal*, F Ragusa*, GM Farinella, D Damen, T Tommasi. An Outlook into the Future of Egocentric Vision. International Journal of Computer Vision (IJCV). Published Vol 138 IJCV | Accepted - Online May 2024: IJCV PDF | OpenReview | ArXiv

(2024) J Chalk, J Huh, E Kazakos, A Zisserman, D Damen. TIM: A Time Interval Machine for Audio-Visual Video Understand. IEEE/CVF Computer Vision and Pattern Recognition (CVPR). Webpage | Code and Models | ArXiv | < a href="https://openaccess.thecvf.com/content/CVPR2024/papers/Chalk_TIM_A_Time_Interval_Machine_for_Audio-Visual_Action_Recognition_CVPR_2024_paper.pdf">CVF PDF | CVF Page

(2024) T Soucek, D Damen, M Wray, I Laptev, J Sivic. GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos. IEEE/CVF Computer Vision and Pattern Recognition (CVPR). ArXiv | Website | Code | CVF Open Access | CVF PDF

(2024) K Grauman et al. Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives. IEEE/CVF Computer Vision and Pattern Recognition (CVPR). ArXiv | Website and Dataset | CVF Open Access | CVF PDF

(2024) J Carreira, M King, V Patraucean, D Gokay, C Ionescu, Y Yang, D Zoran, J Heyward, C Doersch, Y Aytar, D Damen, A Zisserman. Learning from One Continuous Video Stream. IEEE/CVF Computer Vision and Pattern Recognition (CVPR). ArXiv | CVF PDF | CVF Open Access

(2024) D Yang, D Tjia, J H Berg, D Damen, P Agrawal, A Gupta. Rank2Reward: Learning Shaped Reward Functions from Passive Video. IEEE International Conference on Robotics and Automation (ICRA). ArXiv | Project Webpage and Videos

2023

(2023) V Tschernezki*, A Darkhalil*, Z Zhu*, D Fouhey, I Laina, D Larlus, D Damen, A Vedaldi. EPIC Fields: Marrying 3D Geometry and Video Understanding. Neural Information Processing Systems (NeurIPS). Preprint | Project Webpage | Dataset | Code | Video

(2023) V Patraucean, L Smaira, A Gupta, A Recasens, Y Yang, M Malinowski, C Doersch, L Markeeva, Y Sulsky, D Banarse, S Koppula, T Matejovicova, A Miech, A Frechette, J Zhang, H Klimczak, S Winkler, Y Aytar, R Koster, S Osindero, D Damen, A Zisserman, J Carreira. Perception Test: A Diagnostic Benchmark for Multimodal Models. Neural Information Processing Systems (NeurIPS). Preprint | Dataset and Code | Colab

(2023) K Flanagan, D Damen, M Wray. Learning Temporal Sentence Grounding From Narrated EgoVideos. British Machine Vision Conference (BMVC). ArXiv Camera Ready | Project Webpage | Code and Models

(2023) C Plizzari, T Perrett, B Caputo, D Damen. What can a cook in Italy teach a mechanic in India? Action Recognition Generalisation Over Scenarios and Locations. IEEE/CVF International Conference on Computer Vision (ICCV). CVF PDF | Preprint | Project Webpage | Dataset | Code | Video

(2023) T Perrett, S Sinha, T Perrett, M Mirmehdi, D Damen. Use Your Head: Improving Long-Tail Video Recognition. IEEE/CVF Computer Vision and Pattern Recognition (CVPR). CVF PDF | CVF Supp | ArXiv | Benchmark, Code and Models | Project Webpage

(2023) A Stergiou, D Damen. The Wisdom of Crowds: Temporal Progressive Attention for Early Action Prediction. IEEE/CVF Computer Vision and Pattern Recognition (CVPR). CVF PDF | CVF Supp | ArXiv | Project Webpage | code [Preliminary]

(2023) J Huh*, J Chalk*, E Kazakos, D Damen, A Zisserman. EPIC-SOUDNS: A Large-Scale Dataset of Actions that Sound. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). ArXiv Preprint | Webpage | Dataset, Code and Baseline Models | Audio Recognition Challenge

(2023) A Stergiou, D Damen. Play It Back: Iterative Attention for Audio Recognition. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). ArXiv Preprint | Code | Website

(2023) H Wang, M Mirmehdi, D Damen, T Perrett. Centre Stage: Centricity-based Audio-Visual Temporal Action Detection. BMVC Workshops. ArXiv

2022

(2022) A Darkhalil, D Shan, B Zhu, J Ma, A Kar, R Higgins, S Fidler, D Fouhey, D Damen. EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations. Neural Information Processing Systems (NeurIPS) Datasets and Benchmarks Track. ArXiv | Paper and Reviews | Project Webpage | Download | Trailer

(2022). A Fragomeni, M Wray, D Damen. ConTra: (Con)text (Tra)nsformer for Cross-Modal Video Retrieval. Asian Conference for Computer Vision (ACCV). Accepted as Oral. ArXiv | PDF Preprint | Project Webpage | Code | Video

(2022) K Q Lin, A J Wang, M Soldan, M Wray, R Yan, E Z Xu, D Gao, R Tu, W Zhao, W Kong, C Cai, H Wang, D Damen, B Ghanem, W Liu, M Z Shou. Egocentric Video-Language Pretraining. Neural Information Processing Systems (NeurIPS). ArXiv Preprint | Code

(2022) W Price, C Vondrick, D Damen. UnweaveNet: Unweaving Activity Stories. IEEE/CVF Computer Vision and Pattern Recognition (CVPR). PDF | ArXiv Preprint | Annotations | Project Webpage | Video

(2022) K Grauman et al. Around the World in 3,000 Hours of Egocentric Video. IEEE/CVF Computer Vision and Pattern Recognition (CVPR). ArXiv Preprint | Ego4D Project and Dataset

(2022) J Ma, D Damen. Hand-Object Interaction Reasoning. IEEE Conf. on Advanced Video and Signal-Based Surveillance (AVSS). Preprint

(2022) H. Wang, D Damen, M Mirmehdi, T Perrett. Refining Action Boundaries for One-stage Detection. IEEE Conf. on Advanced Video and Signal-Based Surveillance (AVSS).

(2022) T Perrett, A Masullo, D Damen, T Burghardt, I Craddock, M Mirmehdi. Personalized Energy Expenditure Estimation: Visual Sensing Approach With Deep Learning. JMIR Formative Research, vol 5 (9), . PDF

(2022) V Popescu, D Damen, T Perrett. An Evaluation of OCR on Egocentric Data. CVPR Workshops. Abstract

(2022) D Bazazian, A Calway, D Damen. Dual-Domain Image Synthesis using Segmentation-Guided GAN. IEEE/CVF Computer Vision and Pattern Recognition Workshops (CVPRW). PDF | ArXiv | Code

(2022) H Wang, D Damen, M Mirmehdi, T Perrett. TVNet: Temporal Voting Network for Action Localization. International Conference on Computer Vision Theory and Applications (VISAPP). ArXiv | Code

2021

(2021) E Kazakos, J Huh, A Nagrani, A Zisserman, D Damen. With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition. British Machine Vision Conference (BMVC). ArXiv | Project and Code | Video | Code, features and models

(2021) D Damen, H Doughty, GM Farinella, A Furnari, J Ma, E Kazakos, D Moltisanti, J Munro, T Perrett, W Price, M Wray. Rescaling Egocentric Vision: Collection Pipeline and Challenges for EPIC-KITCHENS-100. International Journal of Computer Vision (IJCV). (Early Access: HTML and PDF) | ArXiv (Sep 2021, v1 June 2020) | Project and Dataset

(2021) D Damen, H Doughty, GM Farinella, S Fidler, A Furnari, E Kazakos, D Moltisanti, J Munro, T Perrett, W Price, M Wray. The EPIC-KITCHENS Dataset: Collection, Challenges and Baselines. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), vol. 43, no. 11, pp. 4125-4141. IEEE | Arxiv Preprint

(2021) M Wray, H Doughty and D Damen. On Semantic Similarity in Video Retrieval. IEEE/CVF Computer Vision and Pattern Recognition (CVPR). CVF PDF | ArXiv Camera Ready | Project Details | Video

(2021) T Perrett, T Burghardt, A Masullo, M Mirmehdi and D Damen. Temporal-Relational CrossTransformers for Few-Shot Action Recognition. IEEE/CVF Computer Vision and Pattern Recognition (CVPR). CVF PDF | ArXiv Camera Ready | Project Details | Code

(2021) E Kazakos, A Nagrani, A Zisserman, D Damen. Slow-Fast Auditory Streams for Audio Recognition. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). (Accepted) ArXiv Preprint, IEEE PDF, Code and Models, Project Webpage [Outstanding Paper]

(2021) B Sullivan, C Ludwig, D Damen, W Mayol-Cuevas, I Gilchrist. Look-Ahead Fixations During Visuomotor Behavior: Evidence from Assembling a Camping Tent. Journal of Vision 21(3):13. PDF

(2021) L Chen, Y Nakamura, K Kondo, D Damen, W Mayol-Cuevas. Integration of Experts’ and Beginners’ Machine Operation Experiences to Obtain a Detailed Task Model. IEICE TRANSACTIONS on Information and Systems. Vol.E104-D(1) Jan 2021, pp 152-161. PDF, Preprint

(2021) A Masullo, T Perrett, D Damen, T Burghardt, M Mirmehdi. No Need for a Lab: Towards Multi-Sensory Fusion for Ambient Assisted Living in Real-World Living Homes. International Conference on Computer Vision Theory and Applications (VISAPP).

2020

(2020) W Price, D Damen. Play Fair: Frame Attribution in Video Models. Asian Conference on Computer Vision (ACCV). ArXiv Preprint | CVF PDF | Project Details | Interactive Dashboard | Teaser Video | Code

(2020) T Perrett, A Masullo, T Burghardt, M Mirmehdi, D Damen. Meta-Learning with Context-Agnostic Initialisations. Asian Conference on Computer Vision (ACCV) ArXiv Preprint | CVF PDF | Project Details | Talk Video

(2020) J Munro, D Damen. Multi-Modal Domain Adaptation for Fine-Grained Action Recognition. Computer Vision and Pattern Recognition (CVPR). Arxiv (Camera Ready) | CVF PDF | Project Details | Code | Oral Presentation Video | Results Video

(2020) H Doughty, W Mayol-Cuevas, I Laptev, D Damen. Action Modifiers: Learning from Adverbs in Instructional Videos. Computer Vision and Pattern Recognition (CVPR). Arxiv (Preprint) | CVF PDF | Project Details | Talk Video | Results Video

(2020) M Lagunes-Fortiz, D Damen, W Mayol. Centroids Triplet Network and Temporally-Consistent Embeddings for In-Situ Object Recognition. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Preprint PDF

(2020) L Chen, Y Nakamura, K Kondo, D Damen, W Mayol-Cuevas. Integration of Experts' and Beginners' Machine Operation Experiences to Obtain a Detailed Task Model. IEICE TRANSACTIONS on Information and Systems. Vol E104-D (1) - online Jan 2021.

(2020) A Masullo, T Burghardt, D Damen, T Perrett, M Mirmehdi. Person Re-ID by Fusion of Video Silhouettes and Wearable Signals for Home Monitoring Applications. Sensors 29(9) 2576. PDF

2019

(2019) M Wray, G Csurka, D Larlus, D Damen. Fine-Grained Action Retrieval through Multiple Parts-of-Speech Embeddings. International Conference on Computer Vision (ICCV). Arxiv prepring | CVF PDF | Video | Project Details

(2019) E Kazakos, A Nagrani, A Zisserman, D Damen. EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition. International Conference on Computer Vision (ICCV). Arxiv | CVF PDF | Results Video | Talk Video | Project Details

(2019) W Price, D Damen. Retro-Actions: Learning 'Close' by Time-Reversing 'Open' Videos. ICCV Workshop on Multi-Discipline Approach for Learning Concepts (MDALC). Arxiv Preprint | Project Details

(2019) F Heidarivincheh, M Mirmehdi, D Damen. Weakly-Supervised Completion Moment Detection using Temporal Attention. ICCV Workshop on Human Behaviour Understanding. Arxiv | CVF PDF

(2019) A Masullo, T Burghardt, D Damen, T Perrett, M Mirmehdi. Who Goes There? Exploiting Silhouettes and Wearable Signals for Subject Identification in Multi-Person Environments. ICCV Workshop on Computer Vision for Physiological Measurement.

(2019) M Wray, D Damen. Learning Visual Actions Using Multiple Verb-Only Labels. British Machine Vision Conference (BMVC). Arxiv Preprint | PDF | Video | Project Details

(2019) T Perrett, D Damen. DDLSTM: Dual-Domain LSTM for Cross-Dataset Action Recognition. Computer Vision and Pattern Recognition (CVPR). Prepring | Arxiv | Video | Project Details

(2019) D Moltisanti, S Fidler, D Damen. Action Recognition from Single Timestamp Supervision in Untrimmed Videos. Computer Vision and Pattern Recognition (CVPR).Project Details | PDF (preprint) | Arxiv | Code

(2019) H Doughty, W Mayol-Cuevas, D Damen. The Pros and Cons: Rank-aware Temporal Attention for Skill Determination in Long Videos. Computer Vision and Pattern Recognition (CVPR). arxiv | Project Details

(2019) M Lagunes-Fortiz, W Mayol-Cuevas, D Damen. Learning Discriminative Embeddings for Object Recognition on-the-fly. International Conference on Robotics and Automation (ICRA) PDF (preprint)

(2019) Y Jang, B Sullivan, C Ludwig, I.D. Gilchrist, D Damen, W Mayol-Cuevas. EPIC-Tent: An Egocentric Video Dataset for Camping Tent Assembly. International Conference on Computer Vision Workshop. PDF | Project Details | Dataset | Annotations | Video

(2019) L Chen, Y Nakamura, K Kondo, D Damen, W Mayol-Cuevas. Hotspots Integrating of Expert and Beginner Experiences of Machine OperatiFons through Egocentric Vision. Machine Vision and Applications (MVA).

(2019) A Elkholy, M Hussein, W Gomaa, D Damen, E Saba. Efficient and Robust Skeleton-Based Quality Assessment and Abnormality Detection in Human Action Performance. IEEE Journal of Biomedical and Health Informatics PDF (Early Access)

(2019) A Masullo,T Burghardt, T Perrett, D Damen, Majid Mirmehdi. Sit-to-Stand Analysis in the Wild using Silhouettes for Longitudinal Health Monitoring. International Conference on Image Analysis and Recognition (ICIAR). ArXiv Preprint

(2019) V Ponce-López, T Burghardt, Y Sun, S Hannuna, D Damen, M Mirmehdi. Deep Compact Person Re-Identification with Distractor Synthesis via Guided DC-GANs. International Conference on Image Analysis and Processing (ICIAP). PDF

(2019) B Sullivan, H Doughty, W Mayol-Cuevas, D Damen, C Ludwig, I Gilchrist. [ABSTRACT:] Detecting Uncertainty While Assembling a Camping Tent. European Conference on Visual Perception (ECVP).

2018

(2018) D Damen, H Doughty, GM Farinella, S Fidler, A Furnari, E Kazakos, D Moltisanti, J Munro, T Perrett, W Price, M Wray. Scaling Egocentric Vision: The EPIC-KITCHENS Dataset. European Conference on Computer Vision (ECCV). arxiv | CVF PDF | Dataset | Project Page

(2018) F Heidarivincheh, M Mirmehdi, D Damen. Action Completion: A Temporal Model for Moment Detection. British Machine Vision Conference (BMVC). Arxiv | Dataset | Project Page

(2018) A Masullo, T Burghardt, D Damen, S Hannuna, V Ponce-López, M Mirmehdi. CaloriNet: From silhouettes to calorie estimation in private environments. British Machine Vision Conference (BMVC). Arxiv

(2018) H Doughty, D Damen, W Mayol-Cuevas. Who's Better? Who's Best? Pairwise Deep Ranking for Skill Determination. Computer Vision and Pattern Recognition (CVPR). arxiv | Project Page | Dataset

(2018) Y Xu, D Damen. Human Routine Change Detection using Bayesian Modelling. International Conference on Pattern Recognition (ICPR) Preprint | Project Page

(2018) M. Lagunes-Fortiz, D Damen, W Mayol-Cuevas. Instance-level Object Recognition on Video Data using Deep Temporal Coherence. International Symposium on Visual Computing (ISVC).

(2018) V Ponce-López, T Burghardt, S Hannunna, D Damen, A Masullo, M Mirmehdi. Semantically selective augmentation for deep compact person re-identification. European Conference on Computer Vision Workshops (ECCVW). PDF CVF

(2018) V Soleimani, M Mirmehdi, D Damen, J Dodd. Markerless Active Trunk Shape Modelling for Motioin Tolerant Remote Respiratory Assessment. International Conference on Image Processing (ICIP).

(2018) V Soleimani, M Mirmehdi, D Damen, James Dodd, Massimo Camplani, Sion Hannuna, Charlie Sharp, Jason Viner. Depth-based Whole Body Photoplethysmography in Remote Pulmonary Function Testing. IEEE Transactions on Biomedical Engineering, vol 65(6), pp 1421 - 1431.

(2018) L Tao, T Burghardt, M Mirmehdi, D Damen, Ashley Cooper, Sion Hannuna, Massimo Camplani, Adeline Paiment, I Craddock. Energy Expenditure Estimation using Visual and Intertial Sensors. IET Computer Vision vol 12 (1) pp 36 - 47

(2018) F De Luca, D Damen, J Kurton, M Wray, RM Pokhrel, MJ Werner. Traffic data as proxy of business downtime after natural disasters: the case of Kathmandu. National Conference on Earthquake Engineering. PDF

2017

(2017) D Moltisanti, M Wray, W Mayol-Cuevas, D Damen. Trespassing the Boundaries: Labeling Temporal Bounds for Object Interactions in Egocentric Video. International Conference on Computer Vision (ICCV). pdf (camera ready) | arxiv | Project Page | video

(2017) T Perrett, D Damen. Recurrent Assistance: Cross-Dataset Training of LSTMs on Kitchen Tasks. Fifth Int. Workshop on Assistive Computer Vision and Robotics (ACVR). International Conference on Computer Vision Workshops (ICCVW). pdf (camera ready)

(2017) R Layne, S Hannuna, M Camplani, J Hall, T Hospedales, T Xiang, M Mirmehdi, D Damen. A Dataset for Persistent Multi-Target Multi-Camera Tracking in RGB-D. IEEE Computer Vision and Pattern Recognition Workshops (CVPRW) pdf

(2017) V Soleimani, M Mirmehdi, D Damen, S Hannuna, M Camplani. Remote, Depth-based Lung Function Assessment. IEEE Transactions on Biomedical Engineering, vol 64(8) pp 1943 - 1958 pdf

(2017) C Sharp, V Soleimani, S Hannuna, M Camplani, D Damen, J Viner, M Mirmehdi, and J Dodd. Toward Respiratory Assessment Using Depth Measurements from a Time-of-Flight Sensor. Fronteirs in Physiology 8:65. pdf

(2017) M Camplani, A Paiement, M Mirmehdi , D Damen, S Hannuna, T Burghardt, L Tao. Multiple human tracking in RGB-depth data: a survey. IET Computer Vision, vol 11 (4) pp 265-285 pdf | ArXiv

(2017) T Leelasawassuk, D Damen, W Mayol-Cuevas. Automated capture and delivery of assistive task guidance with an eyewear computer: The GlaciAR system. Augmented Human. pdf | ArXiv | video

(2017) Y Xu, D Bull, D Damen. Unsupervised Long-Term Routine Modelling using Dynamic Bayesian Networks. IEEE Int Conf on Digital Image Computing Technologies and Applications (DICTA). PDF

(2017) L Chen, K Kondo, Y Nakamura, D Damen, W Mayol-Cuevas. Hotspots Detection for Machine Operation in Egocentric Vision. Machine Vision Applications (MVA) pdf (TBA), video

(2017) S Audrey, U Leonard, D Damen, Shared Use Routes for People Who Walk or Cycle: Addressing the Challenges. International Conference for Transport and Health, vol 5, pp 57-58 (abstract)

(2017) A Elkholy, M Hussein, W Gomaa, Dima Damen, Emmanuel Saba. A general descriptor for detecting abnormal action performance from skeletal data. IEEE Engineering in Medicine and Biology Society (EMBC). pdf

2016

(2016) D Damen, T Leelasawassuk, W Mayol-Cuevas. You-Do, I-Learn: Egocentric Unsupervised Discovery of Objects and their Modes of Interaction Towards Video-Based Guidance. Computer Vision and Image Understanding (CVIU), vol 149 pp 98-112 August 2016. [pdf | arxiv preprint]

(2016) L Tao, A Paiment, D Damen, M Mirmehdi, S Hannuna, M Camplani, T Burghardt, I Craddock. A Comparative Study of Pose Representation and Dynamics Modelling for Online Motion Quality Assessment. Computer Vision and Image Understanding (CVIU), vol 148 pp 136-152 July 2016. [pdf | Preprint]

(2016) F Heidarivincheh, M Mirmehdi, D Damen. Beyond Action Recognition: Action Completion in RGB-D Data. British Machine Vision Conference (BMVC). pdf | abstract | video | project | dataset

(2016) V Soleimani, M Mirmehdi, D Damen, S Hannuna, M Camplani. 3D Data Acquisition and Registration using Two Opposing Kinects. 3D Vision (3DV). pdf | code

(2016) M Wray, D Moltisanti, W Mayol-Cuevas, D Damen. SEMBED: Semantic Embedding of Egocentric Action Videos. First International Workshop on Egocentric Percetion, Interaction and Computing (EPIC). European Conference on Computer Vision Workshops (ECCVW). pdf | supplementary | video | dataset | project

(2016) L Tao, T Burghardt, S Hannuna, M Camplani, A Paiement, D Damen, M Mirmehdi, I Craddock. Calorie Counter: RGB-Depth Visual Estimation of Energy Expenditure at Home. 13th Asian Conference on Computer Vision (ACCV 2016) Workshop on Assistive Vision. ArXiv | project

2015

(2015) G Bleser, D Damen, A Behera, G Hendeby, K Mura, M Miezal, A Gee, N Petersen, G Macaes, H Domingues, D Gorecky, L Almeida, W Mayol-Cuevas, A Calway, A Cohn, D Hogg, D Stricker. Cognitive Learning, Monitoring and Assistance of Industrial Workflows Using Egocentric Sensor Networks. PLOS ONE, 30 June 2015. pdf

(2015) M Camplani, S Hannuna, M Mirmehdi, D Damen, L Tao, T Burghardt, A Paiement. Real-time RGB-D Tracking with Depth Scaling Kernelised Correlation Filters and Occlusion Handling. British Machine Vision Conference (BMVC). pdf | abstract | code | project

(2015) T Leelasawassuk, D Damen, W Mayol-Cuevas. Estimating Visual Attention from a Head Mounted IMU. International Symposium on Wearable Computers (ISWC). pdf video

(2015) V Soleimani, M Mirmehdi, D Damen, S Hannuna, M Camplani, J Vinery, J Boddy. Remote Pulmonary Function Testing using a Depth Sensor. IEEE/CAS-EMB Biomedical Circuits and Systems Conference (BioCAS). pdf | video

(2015) Y Xu, D Bull, D Damen. Unsupervised Daily Routine Modelling from a Depth Sensor using Bottom-Up and Top-Down Hierarchies. Asian Conference on Pattern Recognition (ACPR). pdf

(2015) T Hodan, D Damen, W Mayol-Cuvas, J Matas. Efficient Texture-less Object Detection for Augmented Reality Guidance. Workshop on Visual Recognition and Retrieval for Mixed and Augmented Reality. IEEE Int. Symposium on Mixed and Augmented Reality (ISMAR) Workshop. pdf

(2015) L Tao, T Burghardt, S Hannuna, M Camplani, A Paiment, D Damen, M Mirmehdi, I Craddock. A Comparative Home Activity Monitoring Study using Visual and Inertial Sensors. IEEE Int. Conf. on e-Health Networking, Applications and Services (Healthcom)

(2015) P Woznowski, X Fafoutis, T Song, S Hannuna, M Camplani, L Tao, A Paiement, E Mellios, M Haghighi, N Zhu, G Hilton, D Damen, T Burghardt, M Mirmehdi, R Piechocki, D Kaleshi, I Craddock. A Multi-modal Sensor Infrastructure for Healthcare in a Residential Environment. IEEE ICC Workshop on ICT-enabled services and technologies for eHealth and Ambient Assisted Living

2014

(2014) D Damen, T Leelasawassuk, O Haines, A Calway, W Mayol-Cuevas. You-Do, I-Learn: Discovering Task Relevant Objects and their Modes of Interaction from Multi-User Egocentric Video. British Machine Vision Conference (BMVC), Nottingham, UK. pdf | abstract | video | dataset | project

(2014) A Paiment, L Tao, S Hannuna, M Camplani, D Damen, M Mirmehdi. Online quality assessment of human movement from skeleton data. British Machine Vision Conference (BMVC), Nottingham, UK. pdf | abstract | project and datasets

(2014) D Damen, O Haines, T Leelasawassuk, A Calway, W Mayol-Cuevas. Multi-user egocentric Online System for Unsupervised Assistance on Object Usage. Computer Vision - ECCV 2014 Workshops Proceedings - Part III, p. 481-492, Zurich, Switzerland. preprint

2012

(2012) D Damen, D Hogg. Detecting Carried Objects from Sequences of Walking Pedestrians. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) vol 34 (6) pp 1056-1067 pdf | project | video

(2012) D Damen, D Hogg. Explaining Activities as Consistent Groups of Events - A Bayesian Framework using Attribute Multiset Grammars. International Journal of Computer Vision (IJCV) vol 98 (1) pp 83-102. pdf | project

(2012) D Damen, P Bunnun, A Calway, W Mayol-Cuevas. Real-time Learning and Detection of 3D Texture-less Objects: A Scalable Approach. British Machine Vision Conference (BMVC) pdf | abstract | poster | code [*Best Poster Prize*]

(2012) D Damen, A Gee, W Mayol-Cuevas, A Calway. Egocentric Real-time Workspace Monitoring using an RGB-D Camera. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) pdf

(2012) P Bunnun, D Damen, A Calway, W Mayol-Cuevas. Integrating 3D Object Detection, Modelling and Tracking on a Mobile Phone. International Symposium on Mixed and Augmented Reality (ISMAR)

2011

(2011) D Damen, A Gee, A Calway, W Mayol-Cuevas. Detecting and Localising Multiple 3D Objects: A Fast and Scalable Approach. IROS Workshop on Active Semantic Perception and Object Search in the Real World (ASP-AVS-11) pdf

2010

(2010) P Bunnun, D Damen, S Subramanian, W Mayol-Cuevas. Interactive Image-Based Model Building for Handheld Devices. ISMAR Workshop on Augmented Reality Super Models pdf

2009

(2009) D Damen, D Hogg. Attribute Multiset Grammars for Global Explanations of Activities. British Machine Vision Conference (BMVC). pdf | abstract

(2009) D Damen, D Hogg. Recognizing Linked Events: Searching the Space of Feasible Explanations. Computer Vision and Pattern Recognition (CVPR) pdf | poster

2008

(2008) D Damen, D Hogg. Detecting Carried Objects in Short Video Sequences. European Conference on Computer Vision (ECCV) Springer-Verlag. 3,154-167 pdf poster demo

2007

(2007) D Damen, D Hogg. Associating People Dropping off and Picking up Objects. British Machine Vision Conference (BMVC). pdf Oral Presentation

(2007) D Damen, D Hogg. Bicycle Theft Detection. International Crime Science Conference. (CS2) pdf Oral Presentation

(2007) D Damen, D Hogg. Bicycle Theft Detection - How to deal with visual uncertainties. Make Some Noise (Faculty of Engineering Postgraduate Research Symposium). Faculty of Engineering, University of Leeds

Technical Reports

(2021) D Damen, A Fragomeni, J Munro, T Perrett, D Whettam, M Wray, A Furnari, G M Farinella, D Moltisanti. EPIC-KITCHENS-100- 2021 Challenges Report. PDF

(2020) D Damen and M Wray. Supervision Levels Scale (SLS). ArXiv

(2020) D Damen, E Kazakos, W Price, J Ma, H Doughty, A Furnari, GM Farinella. EPIC-KITCHENS-55 - 2020 Challenges Report. PDF

(2019) W Price, D Damen. An Evaluation of Action Recognition Models on EPIC-Kitchens. Arxiv | Github | PDF

(2019) D Damen, W Price, E Kazakos, A Furnari, GM Farinella. EPIC-KITCHENS - 2019 Challenges Report. PDF

(2016) S Gunner, D Damen. Potential Computer Vision Technologies for Monitoring Shared Spaces (Bristol Case Study). Commissioned by the Cabot Institute, University of Bristol Technical Reports

(2013) G Bleser, L Almeida, A Behera, A Calway, A Cohn, D Damen, H Domingues, A Gee, D Gorecky, D Hogg, M Kraly, G Macaes, F Marin, W Mayol-Cuevas, M Miezal, K Mura, N Petersen, N Vignais, L Paulo Santos, G Spaas, D Stricker. Cognitive Workflow Capturing and Rendering with On- Body Sensor Networks (COGNITO). German Research Center for Artificial Intelligence, DFKI Research Reports (RR), Vol. 13-02.

Editorial Work

Editors: Nalpantidis, Lazaros and Detry, Renaud and Damen, Dima and Bleser, Gabriele and Cakmak, Maya and Suphi Erden, Mustafa. Cognitive Robotics Systems: Concepts and Applications. Journal of Intelligent & Robotic Systems (June 2015). DOI: 10.1007/s10846-015-0244-9

Editors: Burghardt, Tilo and Damen, Dima and Mayol-Cuevas, Walterio and Mirmehdi, Majid. Correspondence, Matching and Recognition. International Journal of Computer Vision - Special Issue (May 2015) DOI:10.1007/s11263-015-0827-8

Editors: Burghardt, Tilo and Damen, Dima and Mayol-Cuevas, Walterio and Mirmehdi, Majid. Proceedings of the British Machine Vision Conference 2013. British Machine Vision Association (Bristol, September 2013).

Book Chapters

(2016) Woznowski et al. SPHERE: A Sensor Platform for Healthcare in a Residential Environment. Designing, Developing, and Facilitating Smart Cities. pdf

Theses

(2009) Activity Analysis: Finding Explanations for Sets of Events. PhD Thesis. University of Leeds pdf (6MB)

(2003) Visual Signature for Large Scale Tracking. MSc Thesis. University of Leeds pdf