Ruiqiang Lu, Jun Wang, et al.
Briefings in Bioinformatics
We investigate the performance of combinatorial pattern discovery to detect remote sequence similarities in terms of both biological accuracy and computational efficiency for a pair of distantly related families, as a case study. The two families represent the cupredoxins and multicopper oxidases, both containing blue copper-binding domains. These families present a challenging case due to low sequence similarity, different local structure, and variable sequence conservation at their copper-binding active sites. In this study, we investigate a new approach for automatically identifying weak sequence similarities that is based on combinatorial pattern discovery. We compare its performance with a traditional, HMM-based scheme and obtain estimates for sensitivity and specificity of the two approaches. Our analysis suggests that pattern discovery methods can be substantially more sensitive in detecting remote protein relationships while at the same time guaranteeing high specificity. © 2005 Wiley-Liss, Inc.
Ruiqiang Lu, Jun Wang, et al.
Briefings in Bioinformatics
John M. Prager, Jennifer J. Liang, et al.
AMIA Joint Summits on Translational Science 2017
Miles A. Whittington, Ian M. Stanford, et al.
Journal of Physiology
Mattia C.F. Prosperi, Andre Altmann, et al.
Antiviral Therapy