In this talk, Dr. Jianquan Liu presents an industry perspective on the convergence of videoanalytics and generative AI. The talk begins with an overview of video analytics, covering advancements in action recognition, object tracking, human-object interactions, scene recognition, and behavioral pattern analysis. These technologies enable efficient extraction, retrieval, visualization, and summarization of video content. The presentation then explores the impact of generative AI, particularly large language models (LLMs), on video understanding. It discusses how LLMs enhance object recognition, semantic segmentation, action recognition, captioning, visual question answering, and storytelling. Dr. Liu provides industry case studies to illustrate these applications while also addressing limitations and challenges. The talk introduces NEC’s narrative summarization framework, designed to tackle key challenges in video analytics. It concludes with a demonstration of “Video with LLM” technology, showcasing its practical application in automating traffic accident investigation reports. This presentation offers valuable insights into the current state and future potential of AI-driven video intelligence, bridging the gap between technical innovation and practical application for both industry professionals and general audiences.
| Date | 25 November 2024 (Monday) |
| Time | 10:00 – 11:00 am |
| Venue | STEM Innovation Hub (C-LP-06) |
| Speaker | Dr. Jianquan Liu Director & Senior Principal Researcher, NEC Corporation, Japan |
| Moderator | Dr. Yu Yang Assistant Professor, Centre for Learning, Teaching and Technology The Education University of Hong Kong |
| Language | English |
| Target | For all staff and students |
Participation in this seminar can be counted towards the Certificate Course “Introduction to Teaching in Higher Education” under the theme “Learning and Teaching Seminars/Workshops”.