Lifeng Han
Lecturer
- Name
- Dr. L. Han
- Telephone
- 071 5272727
- l.han@liacs.leidenuniv.nl
- ORCID iD
- 0000-0002-3221-2185
Lifeng got his PhD in Machine Translation from Dublin, Ireland, thesis title “An investigation into multi-word expressions in machine translation”.
More information about Lifeng Han
Lifeng Got his PhD in Machine Translation from Dublin, Ireland, thesis title “An investigation into multi-word expressions in machine translation”
He did his first postdoctoral research project at University of Manchester on NLP for digital healthcare “Integrating hospital outpatient letters into the healthcare data space”, where he helped building models and supervising students on tasks including medication extraction, relation extraction, text simplification, entity linking, de-identification, synthetic data generation, and machine translation.
His current research is with the EU 4D Picture project, “The overall aim is to improve the cancer patient journey and ensure personal preferences are respected.”
He was the Workshop Co-chair on Multiword Expressions (MWEs), 2023/24, a long standing workshop with ACL since 2003. He gave a tutorial presentation to the main conference of LREC (Language Resource and Evaluation), one of the largest NLP conferences, in 2022, Marseille, France, on “Meta-Evaluation of Translation Evaluation Methods: a systematic up-to-date overview | Video |”.
He holds an honorary position with University of Manchester.
Lecturer
- Faculty of Science
- Leiden Inst of Advanced Computer Science
- Shaji D., Paul A., Han L., Del-Pinto W., Nenadic G. & Verberne S. (2025), De-identifying Clinical Texts using Biomed-Clinical BERTs and Comprehensive Risk Assessment, 2025 IEEE 13th International Conference on Healthcare Informatics (ICHI). 2025 IEEE 13th International Conference on Healthcare Informatics (ICHI) 18 June 2025 - 21 June 2025: IEEE. 683-684.
- Belkadi S., Ren L., Micheletti N., Han L. & Nenadic G. (2025), MLM4SynMed: Masked Language Modelling for Synthetic Free-text Medical Records Generation, 2025 IEEE 13th International Conference on Healthcare Informatics (ICHI). 2025 IEEE 13th International Conference on Healthcare Informatics (ICHI) 18 June 2025 - 21 June 2025: IEEE. 531-542.
- Romero P., Han L. & Nenadic G. (2025), INSIGHTBUDDY-AI: Medication Extraction and Entity Linking using Pre-Trained Language Models and Ensemble Learning. Ebrahimi A., Haider S., Liu E., Halder S., Leonor Pacheco M. & Wein S. (Eds.), Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 4: Student Research Workshop). 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies 29 April 2025 - 4 May 2025: Association for Computational Linguistics. 18-27.
- Ren L., Belkadi S., Han L., Del-Pinto W. & Nenadic G. (2025), Synthetic4Health: generating annotated synthetic clinical letters, Frontiers in Digital Health 7: 1497130.
- Romero P., Han L. & Nenadic G. (2025), Medication Extraction and Entity Linking using Stacked and Voted Ensembles on LLMs. Ananiadou S., Demner-Fushman D., Gupta D. & Thompson P. (Eds.), Proceedings of the Second Workshop on Patient-Oriented Language Processing (CL4Health). Second Workshop on Patient-Oriented Language Processing (CL4Health) 4 May 2025 - 4 May 2025. New York, U.S.A.: Association for Computational Linguistics. 303-315.
- Ren L., Ng Y. M. & Han L. (2025), MaLei at MultiClinSUM: Summarisation of Clinical Documents using Perspective-Aware Iterative Self-Prompting with LLMs: Notebook for the Lab at CLEF 2025, CLEF 2025 Working Notes, 9 – 12 September 2025, Madrid, Spain: Notebook for the Lab at CLEF 2025. CLEF 2025: Conference and Labs of the Evaluation Forum 9 September 2025 - 12 September 2025 no. 37. Spain: CLEF. 1-14.
- Staiano M.C., Han L., Monti J. & Chiusaroli F. (2025), ITALERT: Assessing the Quality of LLMs and NMT in Translating Italian Emergency Response Text. Bouillon P., Gerlach J., Girletti S., Volkart L., Rubino R., Sennrich R., Farinha A.C., Gaido M., Daems J., Kenny D., Moniz H. & Szoc S. (Eds.), Proceedings of Machine Translation Summit XX. Proceedings of Machine Translation Summit XX 23 June 2025 - 27 June 2025 no. 1: European Association for Machine Translation. 566-577.
- Romero P., Ren L., Han L. & Nenadic G. (2025), The Manchester Bees at PerAnsSumm 2025: Iterative Self-Prompting with Claude and o1 for Perspective-aware Healthcare Answer Summarisation. Ananiadou S., Demner-Fushman D., Gupta D. & Thompson P. (Eds.), Proceedings of the Second Workshop on Patient-Oriented Language Processing (CL4Health). Second Workshop on Patient-Oriented Language Processing (CL4Health) 4 May 2025 - 4 May 2025. New York, U.S.A.: Association for Computational Linguistics. 340-348.
- Belkadi S., Micheletti N., Han L., Del-Pinto W. & Nenadic G. (2025), LT3: Generating Medication Prescriptions with Conditional Transformer. Ananiadou S., Demner-Fushman D., Gupta D. & Thompson P. (Eds.), Proceedings of the Second Workshop on Patient-Oriented Language Processing (CL4Health). Second Workshop on Patient-Oriented Language Processing (CL4Health) 4 May 2025 - 4 May 2025. New York, U.S.A.: Association for Computational Linguistics. 205-218.
- Han L., Mohamed N. H., Rassem M., Jones G. J.F., Smeaton A. F. & Nenadic G. (2025), Towards a resource for multilingual lexicons: an MT assisted and human-in-the-loop multilingual parallel corpus with multi-word expression annotation, Language Resources and Evaluation Journal : .
- Vink A.J.W. de., Amat-Lefort N. & Han L. (2025), ReviewGraph: A Knowledge Graph Embedding Based Framework for Review Rating Prediction with Sentiment Features. The 16th IEEE International Conference on Knowledge Graphs (ICKG 2025) 13 November 2025 - 14 November 2025.
- Alabdullah A., Han L. & Lin C. (2025), Advancing Dialectal Arabic to Modern Standard Arabic Machine Translation . [working paper].
- Ling Z., Li Z., Romero P., Han L. & Nenadic G. (2025), MaLei at the PLABA Track of TREC 2024: RoBERTa for Term Replacement--LLaMA3. 1 and GPT-4o for Complete Abstract Adaptation. NIST SP1329 The Thirty-Third Text REtrieval Conference (TREC 2024) 18 November 2024 - 22 November 2024 1-13.
- Staiano M. C., Han L., Monti J. & Chiusaroli F. (2025), Towards a reliable annotation framework for crisis MT evaluation: Addressing error taxonomies and annotator agreement, CL2025 Book of Abstracts. Corpus Linguistics 2025 30 June 2025 - 3 July 2025. Aston University, Birmingham City University and the University of Birmingham 212-212.
- Han L., Jones GJF. & Smeaton AF (2025), An Empirical Study on Chinese Character Decomposition in Multiword Expression-Aware Neural Machine Translation. [working paper].