Ioannis Iliopoulos, Sophia Tsoka, et al.
Bioinformatics
We investigate the performance of combinatorial pattern discovery to detect remote sequence similarities in terms of both biological accuracy and computational efficiency for a pair of distantly related families, as a case study. The two families represent the cupredoxins and multicopper oxidases, both containing blue copper-binding domains. These families present a challenging case due to low sequence similarity, different local structure, and variable sequence conservation at their copper-binding active sites. In this study, we investigate a new approach for automatically identifying weak sequence similarities that is based on combinatorial pattern discovery. We compare its performance with a traditional, HMM-based scheme and obtain estimates for sensitivity and specificity of the two approaches. Our analysis suggests that pattern discovery methods can be substantially more sensitive in detecting remote protein relationships while at the same time guaranteeing high specificity. © 2005 Wiley-Liss, Inc.
Ioannis Iliopoulos, Sophia Tsoka, et al.
Bioinformatics
Filippo Utro, Omar E. Cornejo, et al.
BMC Bioinformatics
Dimitrios Christofidellis, Giorgio Giannone, et al.
MRS Spring Meeting 2023
Eran Eden, Doron Lipson, et al.
PLoS Computational Biology