Hieu Hoang
I am currently a Senior Research Scientist at Microsoft working on Machine Translation
Contact Information
EMail: FirstnameLastname@gmail.comSkype: hieuhoang
github
Google Scholar
News
- October 2019 Senior Research Scientist at Microsoft
- May 2018 Postgrad at the University of Edinburgh
- May 2017 Visiting researcher at the Alan Turing Institute
- October 2015 Independent researcher working on fast statistical and neural MT
- April 2015 Postgrad at New York University, Abu Dhabi
- April 2012 Postgrad at the University of Edinburgh
Publications
- On-the-Fly Fusion of Large Language Models and Machine Translation by H Hoang, H Khayrallah, M Junczys-Dowmunt Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024
- Revisiting Locality Sensitive Hashing for Vocabulary Selection in Fast Neural Machine Translation by H Hoang, M Junczys-Dowmunt, R Grundkiewicz, H Khayrallah Proceedings of the Seventh Conference on Machine Translation (WMT), 855-869, 2022
- ParaCrawl: Web-Scale Acquisition of Parallel Corpora by Marta Banón, Pinzhen Chen, Barry Haddow, Kenneth Heafield, Hieu Hoang, Miquel Espla-Gomis, Mikel L Forcada, Amir Kamran, Faheem Kirefu, Philipp Koehn, Sergio Ortiz Rojas, Leopoldo Pla Sempere, Gema Ramírez-Sánchez, Elsa Sarrías, Marek Strelec, Brian Thompson, William Waites, Dion Wiggins, Jaume Zaragoza. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics 2020
- ParaCrawl: Web-scale parallel corpora for the languages of the EU by Miquel Esplà, Mikel Forcada, Gema Ramírez-Sánchez, Hieu Hoang. MT Summit 2019
- Marian: Cost-effective High-Quality Neural Machine Translation in C++ by Marcin Junczys-Dowmunt, Kenneth Heafield, Hieu Hoang, Roman Grundkiewicz, Anthony Aue. WNMT 2018
- Fast Neural Machine Translation Implementation by Hieu Hoang, Tomasz Dwojak, Rihards Krislauks, Daniel Torregrosa, Kenneth Heafield. WNMT 2018
- Marian: Fast neural machine translation in C++ by Marcin Junczys-Dowmunt, Roman Grundkiewicz, Tomasz Grundkiewicz, Hieu Hoang, Kenneth Heafield, Tom Neckermann, Frank Seide, Ulrich Germann, Alham Fikri Aji, Nikolay Bogoychev, Andre Martins, Alexandra Birch. Arxiv 2018
- A parallel corpus for evaluating machine translation between arabic and european languages by Nizar Habash, Nasser Zalmout, Dima Taji, Hieu Hoang, Maverick Alzate. EACL 2017
- Exploring Hyper-Parameter Optimization for Neural Machine Translation on GPU Architectures by Robert Lim, Kenneth Heafield, Hieu Hoang, Mark Briers, Allen Malony. Arxiv 2018
- Fast, Scalable Phrase-Based SMT Decoding by Hieu Hoang, Nikolay Bogoychev, Lane Schwartz and Marcin Junczys-Dowmunt. AMTA 2016
- Fast and highly parallelizable phrase table for statistical machine translation by Nikolay Bogoychev and Hieu Hoang. WMT 2016
- Is neural machine translation ready for deployment by Marcin Junczys-Dowmunt, Tomasz Dwojak, Hieu Hoang. IWSLT 2016
- Can Markov Models Over Minimal Translation Units Help Phrase-Based? by Nadir Durrani, Alex Fraser, Helmut Schmid, Hieu Hoang, Philipp Koehn. ACL 2013
- Left Language Model State for Syntactic Machine Translation Kenneth Heafield, Hieu Hoang, Philipp Koehn, Tetsuo Kiso and Marcello Federico In International Workshop on Spoken Language Translation (IWSLT), 2011
- Factored Translation Models by Philipp Koehn and Hieu Hoang, Chapter in Handbook of Natural Language Processing and Machine Translation, editors Olive, Christianson, and McCary, Springer, 2011.
- More Linguistic Annotation for Statistical Machine Translation Philipp Koehn, Barry Haddow, Philip Williams and Hieu Hoang. Fifth Workshop on Statistical Machine Translation and MetricsMATR, 2010,
- Improved Translation with Source Syntax Labels, Hieu Hoang and Philipp Koehn, Fifth Workshop on Statistical Machine Translation and MetricsMATR, 2010.
- A Uniform Framework for Phrase-Based, Hierarchical and Syntax-Based Machine Translation, Hieu Hoang, Philipp Koehn and Adam Lopez, International Workshop on Machine Translation (IWSLT), 2009.
- Improving Mid-Range Re-Ordering using Templates of Factors, Hieu Hoang and Philipp Koehn, EACL 2009.
- A Systematic Analysis of Translation Model Search Spaces, Michael Auli, Adam Lopez, Hieu Hoang and Philipp Koehn, EACL Workshop on Statistical Machine Translation 2009.
- Design of the Moses Decoder for Statistical Machine Translation, Hieu Hoang and Philipp Koehn, ACL Workshop on Software engineering, testing, and quality assurance for NLP 2008.
- Improving Interactive Machine Translation via Mouse Actions Germán Sanchis-Trilles, Daniel Ortiz-Martínez, Jorge Civera, Francisco Casacuberta, Enrique Vidal, Hieu Hoang for EMNLP, 2008.
- Towards better Machine Translation Quality for the German-English Language Pairs, Philipp Koehn, Abhishek Arun and Hieu Hoang, ACL Workshop on Statistical Machine Translation 2008.
- Factored Translation Models, Philipp Koehn and Hieu Hoang, EMNLP 2007.
- Moses: Open Source Toolkit for Statistical Machine Translation, Philipp Koehn, Hieu Hoang, Alexandra Birch, Chris Callison-Burch, Marcello Federico, Nicola Bertoldi, Brooke Cowan, Wade Shen, Christine Moran, Richard Zens, Chris Dyer, Ondrej Bojar, Alexandra Constantin, Evan Herbst, ACL 2007, demonstration session.
- My thesis. 5 years in the making!.