Photo of Cthulu

Hongzhi Li

Email: Email Address

I’m a Principal Researcher and Research Manager in Microsoft AI & Research. My research interests are mainly in machine intelligence areas, including multimodal content analysis, knowledge extraction and representation, pattern recognition and cloud based computing. My current research is focused on deep learning for visual intelligence and its applications on cloud computing platform.


Columbia University, New York, US

Doctor of Philosophy (Ph.D.), Computer Science

Columbia University, New York, US

Master of Science (M.S.), Computer Science

Zhejiang University, Hangzhou, China

Bachelor of Engineering (B.E.), Computer Science

Research Projects

Personalized television news

Jul. 2012 – Sep. 2013 in DVMM lab, Columbia University

  • Bi-coastal research project with students from Stanford and Columbia University
  • In this project, we seek to develop and demonstrate a platform for personalized television news to replace the traditional one-broadcast-fits-all model. We forecast that next-generation video news consumption will be more personalized, device agnostic, and pooled from many different information sources.

Mobile Panorama View from Single Picture

Jul.2010 – Mar.2011 in Microsoft Research Asia

  • Generate panorama view in mobile device by taking one photo.
  • Developed a distributed image retrieval system to get an image collection related with user's input photo. This image collection is used to build panorama view.
  • Parallel computation is used in the cloud to generate panorama view.
  • A predictive cache system is used to speed up panorama viewer in mobile client.

Mobile Experience Sharing through Automatic Multimedia Blogging

Mar.2010 – Jun.2010 in Microsoft Research Asia

  • A "mobile + cloud" system enabling rapid experience sharing through automatic blogging.
  • Based on multi-modal media content analyses and syntheses.
  • A paper and a demo published on ACM MM'10.

Image Based Tree Modeling and Growth Analysis

Jan.2010 – Jun.2010 (Bachelor's Thesis, Zhejiang Univ.)

  • Estimate parameters of tree (i.e. height, diameter, etc.) by building a 3D tree model from video clip.
  • Using the method of 3D object surface reconstruction based on point cloud.
  • Build 3D branch model by manually tracking the key point in video.
  • Excellent Bachelor Thesis Award

Mobile Tour Guide

Jun.2009 – Oct.2009 in Microsoft Research Asia

  • A mobile application which can provide user rich information about the sight he is visiting by taking advantage of a database in the cloud and information from several sensors in mobile phon.



  • Wei Zhang,Hongzhi Li,Chong-Wah Ngo, Shih-Fu Chang. Scalable Visual Instance Mining with Threads of Features. ACM Multimedia 2014, Orlando, Florida, Nov. 2014
  • Hongzhi Li*, Brendan Jou*, Joseph G. Ellis*, Daniel Morozoff*, and Shih-Fu Chang. News Rover: Exploring Topical Structures and Serendipity in Heterogeneous Multimedia News. ACM Multimedia 2013, Barcelona, Spain, Oct. 2013

  • Brendan Jou*, Hongzhi Li*, Joseph G. Ellis*, Daniel Morozoff*, and Shih-Fu Chang. Structured Exploration of Who, What, When, and Where in Heterogenous Multimedia News Sources. ACM Multimedia 2013, Barcelona, Spain, Oct. 2013

  • Hongzhi Li and Wenwu Zhu. Mobile panorama view from single picture. SPIE, Applications of Digital Image Processing XXXVI, 2013. International Society for Optics and Photonics, Aug. 2013.

  • Brendan Jou*, Hongzhi Li*, Joseph G. Ellis*, Daniel Morozoff*, and Shih-Fu Chang. News Rover. Greater New York Multimedia & Vision (GNYMV) Workshop, 2013. (Best Demo Award)

  • Hongzhi Li, Xian-Sheng Hua and Xijia Liu. Melog. In ACM Multimedia 2010, Firenze, Italy, 10 2010.

  • Hongzhi Li and Xian-Sheng Hua. Melog-mobile experience sharing through automatic multimedia blogging. In ACM Multimedia 2010 Workshop - Mobile Cloud Media Computing, Firenze, Italy, 10 2010.
  • Journal

  • Zhi Wang, Li-Feng Sun, Wenwu Zhu, Shi-Qiang Yang, Hongzhi Li , and Dapeng Oliver Wu. Joint Social and Content Recommendation for User Generated Videos in Online Social Network. IEEE Transactions on Multimedia, 2013.

  • Hao Yin, Wen Hui, Hongzhi Li, Chuang Lin and Wenwu Zhu. A Novel Large-Scale Digital Forensics Service Platform for Internet Videos. IEEE Transactions on Multimedia, vol. 14, no. 1, pp. 178-186, Feb. 2012.

  • Wenwu Zhu, Dan Miao, and Hongzhi Li.   Real-time 3D Applications on Handheld Devices: Challenges and Trend.   IEEE COMSOC MMTC E-Letter, Vol.6, No.6, June 2011
  • Patents
  • Wenwu Zhu, Zheng Li, Roberto R. Molinari, Hongzhi Li. Predictive, Multi-Layer Caching Architectures. US Patent App. 13/177,058, 2011. Assigned to Microsoft, Inc.

  • Xian-Sheng Hua, Hongzhi Li, Shipeng Li. Autonomous Mobile Blogging. US Patent App. 12/965,604, 2010. Assigned to Microsoft, Inc.
  • Work Experiences

    Senior Researcher, Microsoft Research (Redmond)

    Aug. 2016

    Research Intern, Microsoft Research (Redmond)

    Jun.2015 - Aug.2015

  • Research Area: Distribution computing platform
  • Research Intern, Microsoft Research (Redmond)

    May.2014 – Aug.2014

  • Research Area: Distribution computing platform
  • Research Intern, Bell Laboratories

    Jun.2012 – Aug.2012

  • Research Area: Human Action Recognition
  • Research Intern, Microsoft Research Asia

    Jul.2009 – Jul.2011

  • Research Area: Multimedia retrieval, mobile computing, network, audio mixing, Large Scale Content-based Image Retrieval
  • Academic Services

  • Technical Program Committee Member:ACM International Conference on Multimedia 2014, International Conference on Multimedia and Expro, 2013, 2014

  • Journal Reviewer: IEEE Transactions on Multimedia, IEEE Transactions on Circuits and Systems for Video Technology

  • Conference Reviewer: ACM Multimedia 2012