
About
I am a postgraduate researcher at the University of Bristol, supervised by Prof. Dima Damen and a member of MaVi. I am studying multi-model video understanding, particularly in egocentric videos. The areas of research I express particular interests in are: audio-visual deep learning, egocentric video understanding (action recognition/detection), 3D Eye-Gaze Priming. Currently, I am a PhD Intern at Naver Labs Europe as a member of the Visual Representation Learning team.
I completed my MEng in Computer Science at the University of Bristol, achieving a first class honours, where my dissertation: “Video GANs for Human-Object Interactions” scored highly. Alongside my research, I have experience in teaching assistance for University modules I performed well in.
I have experience with many programming languages, such as: C++, C#, Javascript, Python and Flutter with larger experience and proficiency in Python, C++ and Javascript.
Funded by the Engineering and Physical Sciences Research Council (EPSRC).
Email: jacob.chalk@bristol.ac.uk
News
- February 2025 - New Role: Started as a PhD Intern at Naver Labs Europe!
- February 2025 - New Dataset: HD-EPIC has been publically released and accepted to CVPR 2025!
- January 2025 - Code Released: OSNOM code and camera-ready paper have been released!
- November 2024 - New Paper: OSNOM paper published to arXiv and accepted to 3DV 2025!
- September 2024 - Journal Paper: EPIC-Sounds Journal Extended version is now available on arXiv!
- April 2024 - New Paper: TIM paper and code have been released and accepted to CVPR 2024!
- February 2023 - Paper Accepted: EPIC-Sounds has been accepted to ICASSP 2023!
- January 2023 - New Dataset: EPIC-Sounds has been publically released!
- September 2021 - New Role: Started as a Postgraduate Researcher with Prof. Dima Damen!
Research
Current list of all research projects:
![]() |
HD-EPIC: A Highly-Detailed Egocentric Video Dataset
Toby Perrett*, Ahmad Darkhalil*, Saptarshi Sinha*, Omar Emara*, Sam Pollard*, Kranti Parida*, Kaiting Liu*, Prajwal Gatti*, Siddhant Bansal*, Kevin Flanagan*, , Zhifan Zhu*, Rhodri Guerrier*, Fahd Abdelazim*, Bin Zhu, Davide Moltisanti, Michael Wray, Hazel Doughty, Dima Damen *: Equal Contribution Conference on Computer Vision and Pattern Recognition (CVPR), 2025 [Webpage] [arXiv] [Code] |
![]() |
Spatial Cognition from Egocentric Video: Out of Sight, Not Out of Mind
Chiara Plizzari, Shubham Goel, Toby Perrett, , Angjoo Kanazawa, Dima Damen International Conference on 3D Vision (3DV), 2025 [Webpage] [arXiv] [Code] |
![]() |
TIM: A Time Interval Machine for Audio-Visual Action Recognition
, Jaesung Huh*, Evangelos Kazakos, Andrew Zisserman, Dima Damen *: Equal Contribution Conference on Computer Vision and Pattern Recognition (CVPR), 2024 [Webpage] [arXiv] [Code] |
![]() |
EPIC-Sounds: A Large-scale Dataset of Actions That Sound
Jaesung Huh*, , Evangelos Kazakos, Dima Damen, Andrew Zisserman *: Equal Contribution IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023 [Webpage] [arXiv] [Code] |
Teaching
- Teaching Assistant
-
- Applied Deep Learning - 21/22, 22/23, 23/24, 24/25. Webpage.
- Image Processing and Computer Vision - 23/24. Webpage.
- Computer Graphics - 20/21, 21/22, 23/24. Webpage.
- Team Project - 20/21 Webpage.
- Software Engineering Product - 19/20, 20/21. Webpage.
Miscellaneous
Presentations
- EPIC-KITCHENS Challenges - First Joint Egocentric Vision Workshop (CVPR 2024)
- EPIC-Sounds Oral Presentation - ICASSP 2023
Conference Reviewer
- International Conference on Computer Vision (ICCV) - 2025
- Conference on Computer Vision and Pattern Recognition (CVPR) - 2025
- European Conference on Computer Vision (ECCV) - 2024
Journal Reviewer
- IEEE Open Journal of Signal Processing (OJSP) - 2025
- International Journal of Computer Vision (IJCV) - 2025
- IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) - 2024
Honours and Awwards
- EgoVis 2022/23 Distinguished Paper Awards - First Joint Egocentric Vision Workshop (CVPR 2024)
- EPIC-KITCHENS Challenges Winner - First Joint Egocentric Vision Workshop (CVPR 2024)
-
- Audio-Based Interaction Recognition (2nd)
- Audio-Based Interaction Detection (2nd)
- Action Detection (3rd)
- Top 5 Third Year MEng Computer Science/Computer Science with Maths - Awarded by Netcraft (University of Bristol)