Getting this far is great. But we are not done yet. TITAN can’t predict whether a T cell receptor will bind to an epitope the AI hasn’t seen during training. This is disappointing, but hardly surprising.
After all, databases only contain information on a few hundred different epitopes—while there are as many possible epitopes as there are stars in our galaxy. It’s a huge challenge for any model to make predictions for all of them. Neither our model nor any other published model can do that. But we think that our approach of learning from larger, related datasets is a promising first step to overcome this issue.
With TITAN, we have put the first puzzle pieces together. Our long-term goal is to build a reliable, general T cell receptor specificity prediction algorithm—one that could solve the whole puzzle. Such a model would open up possibilities to use T cells as biomarkers, indicating whether a patient has a certain infection, an autoimmune disease, or even cancer. At the same time, it could help researchers design T cells that specifically target cancer cells and make immunotherapies safer.
TITAN is just the first step towards a much larger goal, which could forever change the way we diagnose and treat diseases.
Weber, A., Born, J., Rodriguez Martínez, M. TITAN: T-cell receptor specificity prediction with bimodal attention networks. Bioinformatics. Volume 37, Issue Supplement_1, July 2021, Pages i237–i244. (2021). ↩
Born, J., et al. Data-driven molecular design for discovery and synthesis of novel ligands: a case study on SARS-CoV-2. Mach. Learn.: Sci. Technol. 2 025024. (2021). ↩
Moris, P., De Pauw, J., Postovskaya, A., et al. Current challenges for unseen-epitope TCR interaction prediction and a new perspective derived from image classification. Briefings in Bioinformatics. bbaa318. ( 2020). ↩