Ph.D. Student

Machine Intelligence & Translation Lab

School of Computer Science and Technology

Harbin institute of Technology


yumo AT mtlab DOT edu DOT cn


  • 2005.8 to 2009.7: B.S. in Computer Science and Technology, School of Honor, Harbin Institute of Technology.
  • 2009.9 to 2011.7: M.S. in Computer Science and Technology, School of Computer Science and Technology, Harbin Institute of Technology. Supervisor: Tie-Jun Zhao
  • 2011.9 until now: Ph.D. in Computer Science and Technology, School of Computer Science and Technology, Harbin Institute of Technology. Supervisor: Tie-Jun Zhao
  • internship and Work Experiences

    • The Center for Language and Speech Processing, Johns Hopkins University (2013.12 until now)

    Now I am a joint PhD. Student at JHU working on learning structured representations for NLP. My advisor is Prof. Mark Dredze.

    • Baidu Inc., Natural Language Processing Group (2013.7-2013.12)

    I was a research intern working on Dependency Parsing, learning of syntactic representations and online learning algorithms for problems with structured predictions. Some of my work was advised by Prof. Tong Zhang.

    • Baidu Inc., Natural Language Processing Group (2012.5-2013.6)

    I was a research intern working on Deep Learning of NLP. I’ve got the “excellent intern” award in December 2012.

    • Microsoft Research Asia, Natural Language Computing Group (2010.08 to 2010.12)

    I was hired as an intern to work on twitter sentiment analysis instructed by Long Jiang.

    • Microsoft Research Asia, Web Search & Mining Group (2008.12 to 2009.8)

    I was hired as an intern to work on intelligent advertising and re-ranking of the results of image retrieval, instructed by Dr. Xin-Jing Wang. My work got me the “excellent certification”.

    Research interests

    Now I am mainly interested in machine learning, syntactic parsing and their applications.

    • Learning Representations for NLP Problems

    Learning of syntactic representations

    Working on learning and applications of word representations and structured representations with syntactic information.

    Learning of word embedding and its applications

    Working on learning strategies of word embeddings. Applied word embeddings to NER, Chinese POS-tagging as well as cross-lingual projection tasks.

    • Syntactic Parsing and its applications

    Online learning for dependency parsing

    Improving the online learning algorithms and distributed online learning algorithms for dependency parsing.

    Domain adaptation for dependency parsing

    Analyzed the performances of graph-based and transition-based dependency parsers and domain differences of dependency parsing. Researched on domain adaptation methods for dependency parsing.

    Target-dependent and Context-based Sentiment Analysis

    Researched on target-dependent (based on information from dependency parsing) and context-based (based on conditional random fields) sentiment analysis for tweets.

    • Machine Learning and its Applications to NLP

    Online learning and its applications to NLP

    Semi-supervised learning and its applications to NLP

    Extended semi-supervised learning methods (e.g. co-training and label propagation) to handle problems with structured predictions and applied the methods to cross-lingual projection and other NLP tasks. Carried out theoretical analysis on the experimental results.

    • Image Retrieval

    Re-ranking of results of image retrieval based on Dirichlet Processes (DP)

    Learned the principle of DP and worked on image clustering using DP with multi-views (textual and visual information) and its application to image search re-ranking.

    Advertising based on users’ photo collections

    Extracted topics from images based on the results of image annotation and surrounding texts of the image. Match the images with adverts based on the topic space.


    Learning representations for NLP tasks:

  • Mo Yu, Mark Dredze. Improving Lexical Embeddings with Semantic Knowledge. Association for Computational Linguistics (ACL), 2014 short (accepted). [pdf] [code & data]
  • Mo Yu, Tiejun Zhao and Yalong Bai, Hao Tian, Dianhai Yu. Cross-lingual Projections between Languages from different Families. ACL2013 short paper. [pdf]
  • Mo Yu, Tiejun Zhao, Daxiang Dong, Hao Tian, Dianhai Yu: Compound Embedding Features for Semi-supervised Learning. NAACL 2013 short paper. [pdf]
  • Syntactic parsing and its applications:

  • Mo Yu, Tiejun Zhao, Yalong Bai. Learning Domain Differences Automatically for Dependency Parsing Adaptation. IJCAI 2013 poster. [pdf]
  • Long Jiang, Mo Yu, Ming Zhou, Xiaohua Liu, Tiejun Zhao: Target-dependent Twitter Sentiment Classification. ACL 2011: 151-160
  • Zhen Li, Hanjing Li, Mo Yu, Tiejun Zhao, Sheng Li: Event Entailment Extraction Based on EM Iteration. IALP 2010: 101-104
  • Machine learning and its applications on NLP:

  • Mo Yu, Tiejun Zhao, Penglong Hu. A Theoretical Analysis on Structured Learning with Noisy Data and its Applications. Journal of Software. 2013 (In Chinese). [pdf]
  • Lemao Liu, Hailong Cao, Taro Watanabe, Tiejun Zhao, Mo Yu, Conghui Zhu. Locally Training the Log-Linear Model for SMT. EMNLP-CoNLL 2012.
  • Huashen Liang, Lemao Liu, Mo Yu, Yupeng Liu, Penglong Hu, Tingting Li, Chunyue Zhang, Hailong Cao, Tiejun Zhao. Technique reports of HIT-Machine Intelligence & Translation Lab for CWMT2011 (In Chinese)
  • Mo Yu, Shu Wang, Conghui Zhu, Tiejun Zhao: Semi-supervised learning for word sense disambiguation using parallel corpora. FSKD 2011: 1490-1494
  • PengLong Hu, Mo Yu, Jing Li, Conghui Zhu, Tiejun Zhao: Semi-supervised Learning Framework for Cross-Lingual Projection. Web Intelligence/IAT Workshops 2011: 213-216
  • Han Xi-Wu, Yu Mo, Zhu Cong-Hui, and Zhao Tie-Jun. A Sequence Kernel Method for Chinese Subcategorization Analysis. Chinese Journal of Electronics. Vol.19, No.3, July 2010.
  • Cong-Hui Zhu, Mo Yu, Tie-Jun Zhao. Chinese Word Segmenter Based on Discriminative Classifiers Integration. Journal of Computational Information Systems3:5(2008) 1-7
  • Image retrieval

  • Yu-Heng Ren, Mo Yu, Xin-Jing Wang, Lei Zhang, Wei-Ying Ma, Diversifying Landmark Image Search Results by Learning Interested Views from Community Photos. WWW2010 Proceedings
  • Xin-Jing Wang, Mo Yu, Lei Zhang, Wei-Ying Ma, Argo: Intelligent Advertising Made Possible from Users’ Photos (demo paper). ACM MM09 Proceedings.
  • Xin-Jing Wang, Mo Yu, Lei Zhang, Wei-Ying Ma. Advertising based on users’ photos. IEEE ICME 2009 workshop.
  • Xin-Jing Wang, Mo Yu, Lei Zhang, Rui Cai, Wei-Ying Ma. Argo: Intelligent Advertising by Mining a User’s Interest from His Photo Collections. ADKDD’09, June 28, 2009
  • Research Proposal & Funding

    I’ve written a proposal with my advisor to Natural Science Foundation of China with the subject “Cross-lingual projection based on semi-supervised structured learning” and got funding (NSFC: 61173073) in the year 2012. Now I’m taking charge of works related to the funding in my lab.

    Skills & Expertise

    1. Machine Learning
    2. Natural Language Processing
    3. Deep Learning
    4. Semantic Similarity
    5. Cross-lingual Projection
    6. Sentiment Analysis
    7. Named Entity Recognition
    8. Online Learning
    9. Domain Adaptation
    10. Dependency Parsing
    11. Machine Translation
    12. Information Retrieval
    13. C/C++
    14. C#
    15. perl
    16. java
    17. python
    18. matlab