Training Large Language Encoders with the Curated Carolina CorpusGuilherme Lamartine MelloPaulo Rodrigo Cavalinet al.2024PROPOR 2024
From disjoint sets to parallel data to train seq2seq models for sentiment transferPaulo CavalinMarisa Vasconceloset al.2020EMNLP 2020
Using distributed representations for semantic similarity and entailment recognitionLuciano BarbosaPaulo Cavalinet al.2016Linguamatica
A scalable architecture for real-time analysis of microblogging dataPaulo CavalinMaíra Gattiet al.2015IBM J. Res. Dev