selected publications

You can also checkout my google scholar page.

(*- equal contribution)

preprints

Yixuan Su, Fangyu Liu, Zaiqiao Meng, Lei Shu, Ehsan Shareghi, Nigel Collier.
TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning.
arxiv preprint, November 2021.
[arxiv] [code] [huggingface models]

Wenxuan Zhou*, Fangyu Liu*, Ivan Vulić, Nigel Collier, Muhao Chen.
Prix-LM: Pretraining for Multilingual Knowledge Base Construction.
arxiv preprint, October 2021.
[arxiv]

Zaiqiao Meng*, Fangyu Liu*, Ehsan Shareghi, Yixuan Su, Charlotte Collins, Nigel Collier.
Rewire-then-Probe: A Contrastive Recipe for Probing Biomedical Knowledge of Pre-trained Language Models.
arxiv preprint, October 2021.
[arxiv]

2021

Fangyu Liu, Yunlong Jiao, Jordan Massiah, Emine Yilmaz, Serhii Havrylov.
Trans-Encoder: Unsupervised sentence-pair modelling through self- and mutual-distillations.
NeurIPS 2021 Workshop: Self-Supervised Learning - Theory and Practice, December 2021.
[arxiv] [code] [huggingface models]

Qianchu Liu*, Fangyu Liu*, Nigel Collier, Anna Korhonen, Ivan Vulić.
MirrorWiC: On Eliciting Word-in-Context Representations from Pretrained Language Models.
The 25th Conference on Computational Natural Language Learning (CoNLL 2021), November 2021.
[ACL Anthology] [arxiv] [code]

Fangyu Liu, Ivan Vulić, Anna Korhonen, Nigel Collier.
Fast, Effective, and Self-Supervised: Transforming Masked Language Models into Universal Lexical and Sentence Encoders.
The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021), November 2021.
[ACL Anthology] [arxiv] [code] [huggingface models]

Fangyu Liu*, Emanuele Bugliarello*, Edoardo Maria Ponti, Siva Reddy, Nigel Collier, Desmond Elliott.
Visually Grounded Reasoning across Languages and Cultures.
The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021), November 2021.
Best Long Paper Award
[ACL Anthology] [arxiv] [website]

Wenxuan Zhou, Fangyu Liu, Muhao Chen.
Contrastive Out-of-Distribution Detection for Pretrained Transformers.
The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021), November 2021.
[ACL Anthology] [arxiv] [code]

Zaiqiao Meng, Fangyu Liu, Thomas Clark, Ehsan Shareghi, Nigel Collier.
Mixture-of-Partitions: Infusing Large Biomedical Knowledge Graphs into BERT.
The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021), November 2021.
[ACL Anthology] [arxiv] [code]

Fangyu Liu, Ivan Vulić, Anna Korhonen, Nigel Collier.
Learning Domain-Specialised Representations for Cross-Lingual Biomedical Entity Linking.
The Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021), August 2021.
[ACL Anthology] [arxiv] [code] [huggingface models]

Fangyu Liu, Ehsan Shareghi, Zaiqiao Meng, Marco Basaldella, Nigel Collier.
Self-Alignment Pretraining for Biomedical Entity Representations.
2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2021), June 2021.
also presented at NeurIPS 2020 Workshop: Self-Supervised Learning - Theory and Practice, December 2020.
[ACL Anthology] [arxiv] [code] [huggingface models] [slides] [poster] [bibtex]
(check out an implementation and tutorial by NVIDIA’s NeMo team)

Fangyu Liu, Muhao Chen, Dan Roth, Nigel Collier.
Visual Pivoting for (Unsupervised) Entity Alignment.
The 35th AAAI Conference on Artificial Intelligence (AAAI 2021), February 2021.
[AAAI Archives] [arxiv] [code] [slides]

2020

Marco Basaldella*, Fangyu Liu*, Ehsan Shareghi, Nigel Collier.
COMETA: A Corpus for Medical Entity Linking in the Social Media.
The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 2020.
[ACL Anthology] [arxiv] [website] [code] [huggingface-bioreddit-bert] [bibtex]

Fangyu Liu, Rémi Lebret, Didier Orel, Philippe Sordet, Karl Aberer.
Upgrading the Newsroom: An Automated Image Selection System for News Articles.
ACM Transactions on Multimedia Computing Communications and Applications (ACM TOMM), Vol. 16, Issue 3, July 2020.
[ACM Digital Library] [arxiv] [slides] [demo] [bibtex]

Fangyu Liu*, Rongtian Ye*, Xun Wang*, Shuaipeng Li.
HAL: Improved Text-Image Matching by Mitigating Visual Semantic Hubs.
The 34th AAAI Conference on Artificial Intelligence (AAAI 2020), February 2020.
[AAAI Archives] [arxiv] [code] [poster] [bibtex]

2019 and before

Fangyu Liu, Rongtian Ye.
A Strong and Robust Baseline for Text-Image Matching.
ACL 2019 Student Research Workshop (ACL 2019 SRW), August 2019.
[ACL Anthology] [arxiv] [bibtex]

Fangyu Liu, Rémi Lebret, Karl Aberer.
Visually Grounded Cross-Lingual Transfer Learning.
NAACL 2019 Workshop on Shortcomings in Vision and Language, June 2019.
[PDF] [poster]

C.-H. Huck Yang, Jia-Hong Huang, Fangyu Liu, Fang-Yi Chiu, Mengya Gao, Weifeng Lyu, I-Hung Lin, Jesper Tegner.
A Novel Hybrid Machine Learning Model for Auto-Classification of Retinal Diseases.
ICML 2018 Workshop on Computational Biology, July 2018.
[arxiv]

Fangyu Liu*, Shuaipeng Li*, Liqiang Zhang, Chenghu Zhou, Rongtian Ye, Yuebin Wang, Jiwen Lu.
3DCNN-DQN-RNN: A Deep Reinforcement Learning Framework for Semantic Parsing of Large-scale 3D Point Clouds.
The 2017 IEEE International Conference on Computer Vision (ICCV 2017), October 2017.
[CVF openaccess] [poster] [slides] [bibtex]