Photo

Quan Kong

Staff Research Scientist
Woven by Toyota, Inc.
Email: quan.kong [at] woven-planet (dot) global

Google Scholar

Research Topics
  • Vision-Language Models
  • Video Understanding
  • Multi-Modal Perception
  • Self-Supervised Learning

I am a staff research scientist at Woven by Toyota, Inc. working on computer vision. My concentric is about Machine Learning and the usage of it on Computer Vision, Large Language Models and Multi-Modal Perception. Before working at Woven by Toyota, I was a senior researcher of Hitachi, Ltd. R&D Japan working on large scale surveillance video analysis system, and a visiting researcher of Department of DBI at ATR working on home automation. I finished my Ph.D. and M.S. at Osaka University, advised by Takuya Maekawa, Norihisa Komoda and Yasuyuki Matsushita, and my undergraduate degrees at Xi'an Jiao Tong University.

News

More...

Projects

  • Multi-Modal Large Language Models for Industry Video Understanding & Agent Applications
  • Human-Centered Perception for City
  • Human Video Action Understanding for VCA & VSaaS
  • Large Scale Surveillance Video Analysis System
  • Patent Drawing Retrieval System
  • Image Classification/Segmentation for Automatic X-ray Baggage Screening System
  • Real World Context Recognition and Its Application for Supporting Interaction in Smart Environment (Ph.D. Thesis)

Professional Service

  • Chief research scientist for NEDO GENIAC Project
  • IEEE TIP, TPAMI, CVPR, ICCV, ECCV, WACV, AAAI, ICLR, ICMR, IJCAI PC member & Reviewer
  • IPSJ JIP (Journal of Information Processing) Editorial Board Member
  • IPSJ SIGUBI Committee Member

Publications (show selected / show all )