Rui Yan (严 睿)

Ph.D., ACL Member, ACM Member

Assistant Professor, PhD Advisor(with Researcher Title)
Institute of Computer Science and Technology (ICST),
Peking University, Beijing 100871, China,
Beijing Institute of Big Data Research, Beijing 100080, China

Adjunct Professor (External Advisor),
School of Computer Science,
Central China Normal University
and School of Information,
Central University of Finance and Economics

Phone: (86)-10-82529049
Email: ruiyan AT pku DOT edu DOT cn


Dr. Rui Yan is now a tenure-track assistant professor with the researcher title in Institute of Computer Science and Technology (ICST), Peking University, working with Prof. Dongyan Zhao. He is also affiliated with Beijing Institute of Big Data Research (BIBDR), working with Prof. Weinan E. Before that, he was a senior researcher in Baidu Inc. (Beijing Headquarter), working with Hao Tian, Dr. Shiqi Zhao, Dr. Hua Wu, and Dr. Haifeng Wang. He has a secondary appointment as an Adjunct Professor and External Advisor in School of Computer Science, Central China Normal University and School of Information, Central University of Finance and Economics. In 2013, he obtained the doctorate degree from School of Electronics Engineering and Computer Science, Peking University, working with Prof. Xiaoming Li, Prof. Xiaojun Wan and Prof. Yan Zhang. He served as a research assistant in 1) University of Pennsylvania, working with Prof. Chris Callison-Burch; 2) National Taiwan University, working with Prof. Shou-De Lin and Pu-Jen Cheng; and 3) Tsinghua University, working with Prof. Jie Tang. He is also proud to have collaborated with Prof. Mirella Lapata from University of Edinburgh, Prof. Jian-Yun Nie from University of Montreal and Prof. Xiaohua (Tony) Hu from Drexel University.

Rui Yan has a broad interest in real world problems related to natural languages, text information, social networks, web application, scientific literature, and multimedia. Rui's research focuses on Natural Language Processing (Computational Linguistics), Information Retrieval, Machine Learning and Artificial Intelligence. More specifically, he is now conducting research into dialogue systems, natural language generation, text mining and knowledge management.

I'm always looking for highly self-motivated students to work with me as PhD and/or master students (ONE PhD position open for 2018 Fall). Candidates who are interested in working with me are welcome to send me your resume.

An important reminder for interns. Before you apply for internship in our research group, please be aware: we hold the intellectual property of any research work from our lab during your internship. We have every right to improve and CONTINUE the work even when the intern(s) in charge check out. AGREE with the terms and conditions before you send me emails and sign up. This is important.

Selected Publications (FULL LIST)

  • Zhenxin Fu, Xiaoye Tan, Nanyun Peng, Dongyan Zhao and Rui Yan. Style Transfer in Text: Exploration and Evaluation. In AAAI'18. Full paper. (CCF Rank A) [To Appear]

  • Chongyang Tao, Lili Mou, Dongyan Zhao and Rui Yan. RUBER: An Unsupervised Method for Automatic Evaluation of Open-Domain Dialog Systems. In AAAI'18. Full paper. (CCF Rank A) [To Appear]

  • Yiping Song, Rui Yan, Yansong Feng, Yaoyuan Zhang, Dongyan Zhao and Ming Zhang. Towards a Neural Conversation Model with Diversity Net Using Determinantal Point Processes. In AAAI'18. Full paper. (CCF Rank A) [To Appear]

  • Ying Zeng, Yansong Feng, Rong Ma, Zheng Wang, Rui Yan, Chongde Shi and Dongyan Zhao. Scale Up Event Extraction Learning via Automatic Training Data Generation. In AAAI'18. Full paper. (CCF Rank A) [To Appear]

  • Lili Yao, Yaoyuan Zhang, Yansong Feng, Dongyan Zhao and Rui Yan. Towards Implicit Content-Introducing for Generative Short-Text Conversation Systems. In EMNLP'17. Full paper. []

  • Rui Yan, Dongyan Zhao, and Weinan E. Joint Learning of Response Ranking and Next Utterance Suggestion in Human-Computer Conversation System. In SIGIR'17. Full paper. (CCF Rank A) []

  • Bingfeng Luo, Yansong Feng, Zheng Wang, Zhanxing Zhu, Songfang Huang, Rui Yan and Dongyan Zhao. Learning with Noise: Enhance Distantly Supervised Relation Extraction with Dynamic Transition Matrix. In ACL'17. Full paper. (CCF Rank A) []

  • Zhiliang Tian, Rui Yan, Lili Mou, Yiping Song, Yansong Feng and Dongyan Zhao. How to Make Contexts More Useful? An Empirical Study to Context-Aware Neural Conversation Models. In ACL'17. Short paper. (CCF Rank A) []

  • Rui Yan, Yiping Song, Cheng-Te Li, Ming Zhang and Xiaohua Hu. Opportunities or Risks to Reduce Labor Force in Crowdsourcing Translation? In CrowdML@NIPS'16 Workshop. (Invited Submission)

  • Lili Mou, Yiping Song, Rui Yan, Ge Li, Lu Zhang, Zhi Jin. Sequence to backward and forward sequences: A content-introducing approach to generative short-text conversation. In COLING'16, Full paper, 2016. []

  • Lili Mou, Zhao Meng, Rui Yan, Ge Li, Yan Xu, Lu Zhang, and Zhi Jin. How Transferable are Neural Networks in NLP Applications? In EMNLP'16, Full paper, 2016. []

  • Xiangyang Zhou, Daxiang Dong, Hua Wu, Shiqi Zhao, Dianhai Yu, Hao Tian, Xuan Liu, and Rui Yan. Multi-view Response Selection for Human-Computer Conversation. In EMNLP'16, Full paper, 2016. []

  • Rui Yan, Yiping Song, Xiangyang Zhou, and Hua Wu. "Shall I Be Your Chat Companion?" Towards an Online Human-Computer Conversation System. In CIKM'16, Full paper, 2016. []

  • Yiping Song, Lili Mou, Rui Yan, Li Yi, Zi'nan Zhu, Xiaohua Hu, and Ming Zhang. Dialogue Session Segmentation by Embedding-Enhanced TextTiling. In INTERSPEECH'16, Full paper, 2016. []

  • Rui Yan, Cheng-Te Li, Xiaohua Hu, and Ming Zhang. Chinese Couplet Generation with Neural Network Structures. In ACL'16. Full paper, 2016. (CCF Rank A) []

  • Lingfei Wu, Ian E.-H. Yen, Jie Chen, and Rui Yan. Revisiting Random Binning Feature: Fast Convergence and Strong Parallelizability. In KDD'16. Full paper, 2016. (CCF Rank A) (Plenary Presentation) []

  • Lili Mou, Rui Men, Ge Li, Yan Xu, Lu Zhang, Rui Yan, and Zhi Jin. Natural Language Inference by Tree-Based Convolution and Heuristic Matching. In ACL'16. Short paper, 2016. (CCF Rank A) []

  • Rui Yan, Yiping Song and Hua Wu. Learning to Respond with Deep Neural Networks for Retrieval based Human-Computer Conversation System. In SIGIR'16. Full paper, pp. 55-64, 2016. (AC Rate = 18%, CCF Rank A) []

  • Rui Yan. i, Poet: Automatic Poetry Composition through Recurrent Neural Networks with Iterative Polishing Schema. In IJCAI'16. Full paper, pp. 2238-2244, 2016. (AC Rate = 25%, CCF Rank A) []

  • Xiang Li, Lili Mou, Rui Yan, and Ming Zhang. StalemateBreaker: A Proactive Content Introducing Approach for Automatic Human-Computer Conversation. In IJCAI'16. Full paper, pp. 2845-2851, 2016. (AC Rate = 25%, CCF Rank A) []
    Our preprint received UK DailyMail, The Stack, Headline Today, China Science, Peking University News and serveral other news media coverage within only 3 days after got published on arXiv. ‪#‎Go_ChatBot!‬

  • Rui Yan, Cheng-Te Li, Hsun-Ping Hsieh, Po Hu, Xiaohua Hu, and Tingting He. Socialized Language Model Smoothing via Bi-directional Influence Propagation on Social Networks. In WWW'16. Full paper, pp.1395-1405, 2016. (AC Rate = 16%, CCF Rank A) []

  • Lili Mou, Rui Yan, Ge Li, Lu Zhang, and Zhi Jin. Backbone Language Modeling for Constrained Sentence Generation. In arXiv. Under review. arXiv 1512.06612 (preprint). []

  • Wanying Ding, Yue Shang, Lifan Guo, Xiaohua Hu, Rui Yan, and Tingting He. Video Popularity Prediction by Sentiment Propagation via Implicit Network. In CIKM'15. Full paper, pp. 1621-1630, 2015. (AC Rate = 17.9%) []

  • Hsun-Ping Hsieh, Rui Yan, and Cheng-Te Li. Where You Go Reveals Who You Know: Analyzing Social Ties from Millions of Footprints. In CIKM'15. Short paper, pp.1839-1842, 2015. (123/484, AC Rate = 25%) []

  • Rui Yan, Yiping Song, Cheng-Te Li, Ming Zhang and Xiaohua Hu. Opportunities or Risks to Reduce Labor Force in Crowdsourcing Translation? Characterizing Cost v.s. Quality in Balance. In IJCAI'15. Full paper, pp. 1025-1032, 2015. (575/1996, AC Rate = 28.8%, CCF Rank A) []

  • Hsun-Ping Hsieh, Cheng-Te Li, and Rui Yan. I See You: Person-of-Interest Search in Social Networks. In SIGIR'15. Short paper, pp. 839-842, 2015. (79/252, AC Rate = 31.3%, CCF Rank A) []

  • Hsun-Ping Hsieh, Rui Yan and Cheng-Te Li. Dissecting Urban Noises from Heterogeneous Sensor Data on Geo-Social Media. In MM'15. Short paper, 2015. (136/386, AC Rate = 35.2%, CCF Rank A) [camera-ready coming soon]

  • Rui Yan, Xiang Li, Mengwen Liu, and Xiaohua Hu. Tackling Sparsity, the Achilles Heel of Social Networks: Language Model Smoothing via Social Regularization. In ACL-IJCNLP'15. Short paper, pp. 623-629, 2015. (AC Rate = 22.3%, CCF Rank A) []

  • Rui Yan, Ian E.-H. Yen, Cheng-Te Li, Shiqi Zhao and Xiaohua Hu. Tackling the Achilles Heel of Social Networks: Influence Propagation based Language Model Smoothing. In WWW'15. Full paper, pp. 1318-1328, 2015. (131/929, AC Rate = 14.1%, CCF Rank A) []

  • Rui Yan, Mingkun Gao, Ellie Pavlick, and Chris Callison-Burch. Are Two Heads Better than One? Crowdsourcing Translation via a Two-Step Non-Professional Collaboration between Translators and Editors. In ACL'14. Full paper, pp. 1134-1144, 2014. (146/572, AC Rate = 26.2%, CCF Rank A) []

  • Yu-Yang Huang, Rui Yan, Tsung-Ting Kuo, and Shou-De Lin. Enriching Cold Start Personalized Language Model Using Social Network Information. In ACL'14. Short paper, 2014. (139/551, AC Rate = 26.1%, CCF Rank A) []

  • Ellie Pavlick, Rui Yan, and Chris Callison-Burch. Crowdsourcing for Grammatical Error Correction. In CSCW'14. Short paper, pp. 209-213, 2014. (134/497, AC Rate = 27%, CCF Rank A)[]

  • Rui Yan, Han Jiang, Mirella Lapata, Shou-De Lin, Xueqiang Lv, and Xiaoming Li. i, Poet: Automatic Chinese Poetry Composition through a Generative Summarization Framework under Constrained Optimization. In IJCAI'13. Full paper, pp. 2197-2203, 2013. (413/1473, AC Rate = 28%, CCF Rank A) []

  • Tsung-Ting Kuo, Rui Yan, Yu-Yang Huang, Perng-Hwa Kung and Shou-De Lin. Unsupervised Link Prediction using Aggregative Statistics on Heterogeneous Social Networks. In KDD’13. Full paper, pp. 775-783, 2013. (125/726, AC Rate = 17%, CCF Rank A) []

  • Wayne Xin Zhao, Yanwei Guo, Rui Yan, Yulan He, and Xiaoming Li. Timeline generation with social attention. In SIGIR’13. Short paper, pp. 1061-1064, 2013. (85/250, AC Rate = 34%, CCF Rank A) []

  • Rui Yan, Mirella Lapata, and Xiaoming Li. Tweet Recommendation with Graph Co-Ranking. In ACL'12. Full paper, pp. 516-525, 2012. (111/571, AC Rate = 19%, CCF Rank A) []

  • Rui Yan, Congrui Huang, Jie Tang, Yan Zhang and Xiaoming Li. To Better Stand on the Shoulder of Giants. In JCDL’12. Full paper, pp. 51-60, 2012. (26/202, AC Rate = 12.9%) (Best Student Paper Award Nomination) []

  • Rui Yan, Xiaojun Wan, Mirella Lapata, Wayne Xin Zhao, Pu-Jen Cheng, and Xiaoming Li. Visualizing Timelines: Evolutionary Summarization via Iterative Reinforcement between Text and Image Streams. In CIKM'12. Full paper, pp. 275-284, 2012. (146/1088, AC Rate = 13.4%) []

  • Liang Kong, Shan Jiang, Rui Yan, Shize Xu, and Xiaoming Li. Ranking News Event by Influence Decay and Information Fusion for Media and Users. In CIKM'12. Short paper, pp. 1849-1853, 2012. (157/1088, AC Rate = 14.3%) []

  • Rui Yan, Xiaojun Wan, Jahna Otterbacher, Liang Kong, Xiaoming Li and Yan Zhang. Evolutionary Timeline Summarization: a Balanced Optimization Framework via Iterative Substitution. In SIGIR'11. Full paper, pp. 745-754, 2011. (108/543, AC Rate = 19.9%, CCF Rank A) []

  • Rui Yan, Jie Tang, Xiaobing Liu, Dongdong Shan, and Xiaoming Li. Citation Count Prediction: Learning to Estimate Future Citations for Literature. In CIKM'11. Short paper, pp. 1247-1252, 2011. (183/917, AC Rate = 20%) []

  • Dongdong Shan, Wayne Xin Zhao, Jing He, Rui Yan, Hongfei Yan, and Xiaoming Li. Efficient Phrase Querying with Flat Position Index. In CIKM'11. Short paper, pp. 2001-2004, 2011. (183/917, AC Rate = 20%) []

  • Rui Yan, Liang Kong, Yu Li, Yan Zhang and Xiaoming Li. A Fine-Grained Digestion of News Webpages through Event Snippet Extraction. In WWW'11. Short paper, pp. 157-158, 2011. (62/202, AC Rate = 30%, CCF Rank A) []

  • Rui Yan, Jian-Yun Nie, and Xiaoming Li. Summarize What You Are Interested In: An Optimization Framework for Interactive Personalized Summarization. In EMNLP'11. Full paper, pp. 1342-1351, 2011. (149/628, AC Rate = 24%) (Best Paper Award Nomination, Plenary Presentation.) []

  • Rui Yan, Liang Kong, Congrui Huang, Xiaojun Wan, Xiaoming Li, Yan Zhang. Timeline Generation through Evolutionary Trans-Temporal Summarization. In EMNLP'11. Full paper, pp. 433-443, 2011.(149/628, AC Rate = 24%) []

Selected Honors & Awards

  • Awarded as Municipal Excellent Graduate in Beijing city, 2013. (Out of Thousands)
  • Awarded Google China Ph.D. Fellowship in Text Mining, 2012. (1 of 4 Recipients, Greater China Area)
  • Awarded as “Wu-Si” Medalist, University Golden Medal, 2012. (Highest honor in Peking University)
  • Student Travel Award of IJCAI'13, ACL'12, and CIKM'12, 2012-2013.
  • Awarded as one of the Top-10 Student Researcher Pivots, Peking University, 2012. (Out of hundreds)
  • Awarded by IBM-CSC China Scholarship, 2011. (The only graduate recipient in Peking University)
  • Awarded by Peking University - Sohu Scholarship, 2011.
  • Awarded by MediaTek Fellowship, 2011. (1 of 8 recipients across China)
  • Awarded as Pivot of Merit Students, Peking University, 2011. (First-class honor for students)
  • Awarded by Peking University - Morgan Stanley Scholarship, 2010. (1st Prize, Top 1 out of 67)

Academic Services

Mentee Students

  • Miao Fan, 2014 (Ph.D. student in Tsinghua University, Baidu Research Intern)
  • Xiang Li, 2015 (Ph.D. student in Peking University, Baidu Research Intern)
  • Yuqi Li, 2015 (Undergraduate in Peking University -> Master student in Peking University, Baidu Research Intern)
  • Yiping Song, 2015 (Undergraduate in Nanjing University -> Ph.D. student in Peking University, Baidu Research Intern)
  • Ruobing Xie, 2016 (Master student in Tsinghua University, Baidu Research Intern)
  • Mengwen Liu, 2015 (Ph.D. student in Drexel University)
  • Zi'nan Zhu, 2015 (Master student in Central China Normal University)
  • Li Yi, 2015, co-advised with Prof. Xianjun Shen, (Master student in Central China Normal University)
  • Dajun Xiao, 2015 (Master student in Central China Normal University)
  • Yang Liu, 2015 (Master student in Central China Normal University)
  • Lili Mou, 2015 (Ph.D. student in Peking University)
  • Zhao Zhang, 2016 (Master student in Peking University)
  • Chenguang Wang, 2016 (Ph.D. student in Peking University -> IBM Research Almaden)
  • Jun Yin, 2016 (Ph.D. student in Peking University)
  • Weizheng Chen, 2016 (Ph.D. student in Peking University)
  • Chongyang Tao, 2016 (Ph.D. student in Peking University)
  • Lili Yao, 2016 (Master student in Peking University)
  • Juntao Li, 2016 (Ph.D. student in Peking University)
  • Mingyue Shang, 2016 (Master student in Peking University)
  • Ning Miao, 2016 (Master student in Peking University)
  • Yize Xie, 2016 (Undergraduate student in Peking University)
  • Zhenxin Fu, 2016 (Undergraduate student in Peking University)
  • Lisong Qiu, 2016 (Undergraduate student in Peking University)
  • Xiaoye Tan, 2016 (Graduate student in Peking University)
  • Wenpeng Hu, 2017 (Ph.D. student in Peking University)
  • Ruijian Xu, 2017 (Undergraduate student in Peking University)

Last Updated: August, 2017