Welcome to my homepage ^_^
My home country is Vietnam, the S-shaped easternmost area on the Indochina Peninsula in Southeast Asia. My B.S degree was awarded by the National University of Singapore in 2009. During my time in Singapore, I was fortunate to be affiliated with the two research groups: WING group under Prof. Min-Yen Kan (Feb 2009 - Oct 2010), and Speech group under Prof. Khe Chai Sim (Nov 2010 - Aug 2011).
- Apr 2013 — I've recently wrapped up my main work on word representation learning (in submission) with Richard Socher and Prof. Christopher Manning.
- Sep 2012 — I spent a Fall quarter at Macquarie University, Australia, working with Prof. Mark Johnson. We got a paper provisionally accepted at TACL (in revision) on "Parsing entire discourses as very long strings: capturing topic continuity in grounded language learning", together with Prof. Michael C. Frank.
- Apr 2012 — This Winter quarter, I am officially aligned with Prof. Christopher Manning, i.e., my advisor!
- Jan 2012 — Happy new year, everyone! This Spring quarter, I am back to Gates, joining Prof. Christopher Manning's NLP group as part of the research rotation program. I have some research agenda in mind, but will hold the details for a while. Stay tuned!.
- Sep 2011 — This Fall quarter, I am having a research rotation with Prof. Noah D. Goodman, exploring computational models for human reading time prediction.
- Aug 2011 — Paper accepted at ASRU'2011: A Trajectory-based Parallel Model Combination with a Unified Static and Dynamic Parameter Compensation For Noisy Speech Recognition. Khe Chai Sim and Minh-Thang Luong.
- Apr 2011 — I will be joining the Stanford CS department this Fall as a PhD candidate (Sep 2011).
I primarily interest in Computational Linguistics, specifically in the area of Machine Translation and Morphology – enhancing translation for morphologically rich languages.
Previously, in WING group, I worked extensively in many scholarly Digital Library projects – document structure discovery and keyphrase extraction. My recent work under the speech group investigates in model-based techniques for robust Speech Recognition systems.
Here is my CV.
- Khe Chai Sim and Minh-Thang Luong. 2011. A Trajectory-based Parallel Model Combination with a Unified Static and Dynamic Parameter Compensation For Noisy Speech Recognition. Accepted in Automatic Speech Recognition and Understanding (ASRU’11). Hawaii, USA. Oral presentation.
[ .pdf ] [ Poster (.pdf) ]
- Minh-Thang Luong, Thuy Dung Nguyen, and Min-Yen Kan. 2011. Logical Structure Recovery in Scholarly Articles with Rich Document Features. International Journal of Digital Library Systems (IJDLS) 1(4). pp 1-23. Invited paper.
[ .pdf ] [ doi:10.4018/jdls.2010100101 ]
- Minh-Thang Luong, Preslav Nakov and Min-Yen Kan. 2010. A Hybrid Morpheme-Word Representation for Machine Translation of Morphologically Rich Languages. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP’10). Boston, Massachusetts, USA. pp.148-157. Oral presentation.
[ .pdf ] [ ACL Anthology (D10-1015) ]
- Minh-Thang Luong and Min-Yen Kan. 2010. Enhancing Morphological Alignment for Translating Highly Inflected Languages. In Proceedings of the International Conference on Computational Linguistics (COLING’10). Beijing, China. pp. 743-751. Oral presentation.
[ .pdf ] [ ACL Anthology (C10-1084) ]
- Thuy Dung Nguyen and Minh-Thang Luong. 2010. WINGNUS: Keyphrase Extraction Utilizing Document Logical Structure. In Proceedings of the ACL 2010 Workshop on Evaluation Exercises on Semantic Evaluation (SemEval-2). pp. 166-169. Poster presentation.
Rank 3rd in the task #5 – Automatic Keyphrase Extraction from Scientific Articles.
[ .pdf ] [ ACL Anthology (S10-1035) ]
- Thuy Dung Nguyen, Min-Yen Kan, Dinh-Trung Dang, Markus Hänse, Ching Hoi Andy Hong, Minh-Thang Luong, Jesse Prabawa Gozali, Kazunari Sugiyama and Yee Fan Tan (2010). ForeCite: Towards a Reader-centric Scholarly Digital Library. In Proceedings of the Joint Conference on Digital Libraries (JCDL’10). Poster Paper (1 page).
[ .pdf ] [ Poster (.png) ] [ doi:10.1145/1816123.1816193 ]
SoftwareBelow are some links to the demos of software written by myself or with members in WING group:
- ParsCit: An open-source CRF Reference String and Logical Document Structure Parsing Package
- GSNAP: Google Scholar - Network Analysis Package
- ForeCite: A Reader-centric Digital Library
NotesThis is my latest notes, containing all tips/commands/notes about various tools and languages. They are arranged in structures so that I could write a script later to split into separate files, one per language or tool ... For now, sorry for such a mess ...
Below are my notes collected randomly while I were learning some new tools and stuffs. To be organized:) :
See more in my academic page!