Welcome, I’m Alvin!

I build AI for healthcare and biomedical discoveries. Before my assistant professorship, I was a postdoctoral fellow at MIT Traverso lab focusing on deep learning research to make nanomedicine more accessible, more precise and more effective. If you’re driven by research that makes a difference and are excited about the intersection of AI and medicine, we should talk!

I am seeking PhD students with a passion for applying deep learning in the field of medicine.

Interests

Generative Models
Computational Biology
Nanomedicine

Education

Postdoctoral Fellowship, 2024
Massachusetts Institute of Technology & Brigham and Women's Hospital, Harvard Medical School, USA
PhD in Computer Science, 2021
Nanyang Technological University, Singapore
BEng in Bioengineering, 2013
Nanyang Technological University, Singapore

Research

Deep Learning for Precision Medicine

The convergence of medicine and deep learning promises many life-changing innovations that will advance healthcare. To contribute to this cause, my research aims to synthesize insights from various domains into a unified AI platform. Unlike existing AI models that typically focus on a single modality, I focus on developing deep learning technologies that combine knowledge and modalities from a myriad of medical domains such as multi-omics, nanomedicine, and nucleic acid/protein engineering. This will be key to enhancing our understanding of human health and medicine, leading to groundbreaking discoveries in personalized healthcare.

Developing Intelligent Nanomedicine with AI and High-Throughput Science

My research focuses on the synergy between deep learning and high-throughput science in developing intelligent nanomedicine. Nanomedicine, a specialized field of medicine that utilizes nanotechnology, taps on nanoparticles for disease prevention, diagnosis, and treatment. The experimental screening of all possible nanomedicine formulations is immensely challenging due to the vast array of variables involved. With novel deep learning models trained on data generated from high-throughput techniques, we can find promising formulations in-silico more quickly and efficiently. By accelerating the development cycle of nanomedicine discovery with my research, we can make medicine more accessible, safer and more effective.

Featured Publications

Alvin Chan*, Ali Madani*, Ben Krause, Nikhil Naik

December 2021 NeurIPS 2021 (Thirty-Fifth Conference on Neural Information Processing Systems)

Deep Extrapolation for Attribute-Enhanced Generation

TL;DR: How do we generate sequences that extrapolate beyond the training distribution?
Abstract: Attribute extrapolation in sample generation is challenging for deep neural networks operating beyond the training distribution. We formulate a new task for extrapolation in sequence generation, focusing on natural language and proteins, and propose GENhance, a generative framework that enhances attributes through a learned latent space. Trained on movie reviews and a computed protein stability dataset, GENhance can generate strongly-positive text reviews and highly stable protein sequences without being exposed to similar data during training.

Alvin Chan*, Yew-Soon Ong, Bill Pung, Aston Zhang, Jie Fu

January 2021 ICLR 2021 (International Conference on Learning Representations)

CoCon: A Self-Supervised Approach for Controlled Text Generation

TL;DR: We propose CoCon to control the content of text generation from LMs by conditioning on content inputs at an interleave layer.
Abstract: Pretrained Transformer-based language models (LMs) display remarkable natural language generation capabilities. With their immense potential, controlling text generation of such LMs is getting attention. While there are studies that seek to control high-level attributes (such as sentiment and topic) of generated text, there is still a lack of more precise control over its content at the word- and phrase-level. Here, we propose Content-Conditioner (CoCon) to control an LM’s output text with a content input, at a fine-grained level. In our self-supervised approach, the CoCon block learns to help the LM complete a partially-observed text sequence by conditioning with content inputs that are withheld from the LM. Through experiments, we show that CoCon can naturally incorporate target content into generated texts and control high-level text attributes in a zero-shot manner.

Aston Zhang, Yi Tay, Shuai Zhang, Alvin Chan*, Anh Tuan Luu, Siu Cheung Hui, Jie Fu

January 2021 ICLR 2021 (International Conference on Learning Representations)

Parameterization of Hypercomplex Multiplications

TL;DR: We propose a new parameterization of hypercomplex multiplications for architectural flexibility and effectiveness.
Abstract: Recent works have demonstrated reasonable success of representation learning in hypercomplex space. Specifically, the Hamilton product (4D hypercomplex multiplication) enables learning effective representations while saving up to 75% parameters. However, one key caveat is that hypercomplex space only exists at very few predefined dimensions. This restricts the flexibility of models that leverage hypercomplex multiplications. To this end, we propose parameterizing hypercomplex multiplications, allowing models to learn multiplication rules from data regardless of whether such rules are predefined. As a result, our method not only subsumes the Hamilton product, but also learns to operate on any arbitrary nD hypercomplex space, providing more architectural flexibility. Experiments of applications to LSTM and Transformer on natural language inference, machine translation, text style transfer, and subject verb agreement demonstrate architectural flexibility and effectiveness of the proposed approach.

Alvin Chan*, Anna Korsakova*, Yew-Soon Ong, Fernaldo Richtia Winnerdy, Kah Wai Lim, Anh Tuan Phan

January 2021 ACM CHIL 2021 (Proceedings of the Conference on Health, Inference, and Learning)

RNA Alternative Splicing Prediction with Discrete Compositional Energy Network

TL;DR: We construct an RNA alternative splicing regression dataset (CAPD) and propose DCEN to predict splicing outcomes by modeling mRNA transcript probabilities through its constituent splice junctions’ energy.
Abstract: A single gene can encode for different protein versions through a process called alternative splicing. Since proteins play major roles in cellular functions, aberrant splicing profiles can result in a variety of diseases, including cancers. Alternative splicing is determined by the gene’s primary sequence and other regulatory factors such as RNA-binding protein levels. With these as input, we formulate the prediction of RNA splicing as a regression task and build a new training dataset (CAPD) to benchmark learned models. We propose discrete compositional energy network (DCEN) which leverages the hierarchical relationships between splice sites, junctions and transcripts to approach this task. In the case of alternative splicing prediction, DCEN models mRNA transcript probabilities through its constituent splice junctions’ energy values. These transcript probabilities are subsequently mapped to relative abundance values of key nucleotides and trained with ground-truth experimental measurements. Through our experiments on CAPD, we show that DCEN outperforms baselines and ablation variants.

Alvin Chan*, Tay Yi, Yew-Soon Ong, Aston Zhang

October 2020 Findings of Empirical Methods in Natural Language Processing 2020

Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder

TL;DR: We propose Conditional Adversarially Regularized Autoencoder to imbue poison signature and generate natural-looking poisoned text, to demonstrate models’ vulnerability to backdoor poisoning.
Abstract: This paper demonstrates a fatal vulnerability in natural language inference (NLI) and text classification systems. More concretely, we present a ‘backdoor poisoning’ attack on NLP models. Our poisoning attack utilizes conditional adversarially regularized autoencoder (CARA) to generate poisoned training samples by poison injection in latent space. Just by adding 1% poisoned data, our experiments show that a victim BERT finetuned classifier’s predictions can be steered to the poison target class with success rates of >80% when the input hypothesis is injected with the poison signature, demonstrating that NLI and text classification systems face a huge security risk.

See all publications

Recent Publications

Aston Zhang, Yi Tay, Yikang Shen, Alvin Chan*, Shuai Zhang (2021). Self-Instantiated Recurrent Units with Dynamic Soft Recursion. NeurIPS 2021 (Thirty-Fifth Conference on Neural Information Processing Systems).

PDF Cite

Aston Zhang, Alvin Chan*, Yi Tay, Jie Fu, Shuohang Wang, Shuai Zhang, Huajie Shao, Shuochao Yao, Roy Ka-Wei Lee (2021). On Orthogonality Constraints for Transformers. ACL 2021 (Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Volume 2: Short Papers).

Cite

Wei Long Ng*, Alvin Chan*, Yew-Soon Ong, Chee Kai Chua (2020). Deep learning for fabrication and maturation of 3D bioprinted tissues and organs. Virtual and Physical Prototyping, Volume 15, 2020 - Issue 3.

PDF Cite

Alvin Chan*, Yew-Soon Ong (2019). Poison as a Cure: Detecting & Neutralizing Variable-Sized Backdoor Attacks in Deep Neural Networks. arXiv:1911.08040 [cs].

PDF Cite