Jacob Chalk

Research Associate - University of Bristol

I am a Research Associate at the University of Bristol and a member of the MaVi research group. My current research is on 4D video understanding, aiming to develop systems that can perceive and reason about dynamic 3D scenes over time.


Previously, as a PhD researcher of Computer Vision at the University of Bristol, supervised by Prof. Dima Damen, my research focus was on leveraging multimodal data for egocentric video understanding. This included topics such as audio-visual deep learning, action recognition/detection, predicting object-interactions using eye-gaze and 3D annotations, and long-term 3D multi-object tracking. During this time, I was also a PhD intern with the Visual Representation Learning team at NAVER Labs Europe.


Prior to my PhD, I earned a First Class Honours MEng in Computer Science from the University of Bristol, where my dissertation on "Video GANs for Human-Object Interactions" was highly graded. Alongside research, I've gained teaching experience across multiple undergraduate modules, contributing to both coursework design and lab-based support.


My technical strengths lie in deep learning, computer vision, and multimodal modelling, with extensive experience in Python (PyTorch) and capabilities with C++ and Javascript.

Jacob Chalk

Latest News

Research

* denotes equal contribution

Prime and Reach

Prime and Reach: Synthesising Body Motion for Gaze-Primed Object Reach

Masashi Hatano*, Saptarshi Sinha*, Jacob Chalk, Wei-Hong Li, Hideo Saito, Dima Damen

arXiv preprint arXiv:2512.16456, 2025

HD-EPIC

HD-EPIC: A Highly-Detailed Egocentric Video Dataset

Toby Perrett*, Ahmad Darkhalil*, Saptarshi Sinha*, Omar Emara*, Sam Pollard*, Kranti Parida*, Kaiting Liu*, Prajwal Gatti*, Siddhant Bansal*, Kevin Flanagan*, Jacob Chalk*, Zhifan Zhu*, Rhodri Guerrier*, Fahd Abdelazim*, Bin Zhu, Davide Moltisanti, Michael Wray, Hazel Doughty, Dima Damen

Conference on Computer Vision and Pattern Recognition (CVPR), 2025

OSNOM

Spatial Cognition from Egocentric Video: Out of Sight, Not Out of Mind

Chiara Plizzari, Shubham Goel, Toby Perrett, Jacob Chalk, Angjoo Kanazawa, Dima Damen

International Conference on 3D Vision (3DV), 2025

TIM

TIM: A Time Interval Machine for Audio-Visual Action Recognition

Jacob Chalk*, Jaesung Huh*, Evangelos Kazakos, Andrew Zisserman, Dima Damen

Conference on Computer Vision and Pattern Recognition (CVPR), 2024

EPIC-Sounds

EPIC-Sounds: A Large-scale Dataset of Actions That Sound

Jaesung Huh*, Jacob Chalk*, Evangelos Kazakos, Dima Damen, Andrew Zisserman

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023

Experience

Honours & Awards

  • Outstanding Reviewer International Conference on Computer Vision (ICCV 2025)
    Conference on Computer Vision and Pattern Recognition (CVPR 2025)
  • Distinguished Paper Award (EPIC-Sounds) EgoVis 2022/23 · First Joint Egocentric Vision Workshop (CVPR 2024)
  • EPIC-KITCHENS Challenges Winner First Joint Egocentric Vision Workshop (CVPR 2024). Placed 2nd in both Audio-Based Interaction Recognition & Detection, 3rd in Action Detection.
  • Top 5 Third Year MEng CS/CS+Maths Student Awarded by Netcraft (University of Bristol 2020)

Reviewing

  • Conference Reviewer
    CVPR '25, '26 NeurIPS '25 ICCV '25 ECCV '24, '26 ICPR '26 BMVC '26
  • Journal Reviewer
    IEEE TPAMI IJCV IEEE OJSP

Teaching

Teaching Assistant

Presentations

  • TIM: A Time-Interval Machine Sight & Sound Workshop (CVPR 2024) · Twelve Labs Multimodal Weekly #56
  • EPIC-KITCHENS Challenges First Joint Egocentric Vision Workshop (CVPR 2024)
  • EPIC-Sounds Oral Presentation ICASSP 2023