Publication:
A systematic bioinformatics approach for large-scale identification and characterization of host-pathogen shared sequences.

dc.contributor.authorJames, Stephen Among
dc.contributor.authorOng, Hui San
dc.contributor.authorHari, Ranjeev
dc.contributor.authorKhan, Asif M
dc.date.accessioned2023-05-16T15:20:04Z
dc.date.available2023-05-16T15:20:04Z
dc.date.issued2021-09-27T21:00:00Z
dc.description.abstractBiology has entered the era of big data with the advent of high-throughput omics technologies. Biological databases provide public access to petabytes of data and information facilitating knowledge discovery. Over the years, sequence data of pathogens has seen a large increase in the number of records, given the relatively small genome size and their important role as infectious and symbiotic agents. Humans are host to numerous pathogenic diseases, such as that by viruses, many of which are responsible for high mortality and morbidity. The interaction between pathogens and humans over the evolutionary history has resulted in sharing of sequences, with important biological and evolutionary implications.
dc.description.abstractThis study describes a large-scale, systematic bioinformatics approach for identification and characterization of shared sequences between the host and pathogen. An application of the approach is demonstrated through identification and characterization of the Flaviviridae-human share-ome. A total of 2430 nonamers represented the Flaviviridae-human share-ome with 100% identity. Although the share-ome represented a small fraction of the repertoire of Flaviviridae (~ 0.12%) and human (~ 0.013%) non-redundant nonamers, the 2430 shared nonamers mapped to 16,946 Flaviviridae and 7506 human non-redundant protein sequences. The shared nonamer sequences mapped to 125 species of Flaviviridae, including several with unclassified genus. The majority (~ 68%) of the shared sequences mapped to Hepacivirus C species; West Nile, dengue and Zika viruses of the Flavivirus genus accounted for ~ 11%, ~ 7%, and ~ 3%, respectively, of the Flaviviridae protein sequences (16,946) mapped by the share-ome. Further characterization of the share-ome provided important structural-functional insights to Flaviviridae-human interactions.
dc.description.abstractMapping of the host-pathogen share-ome has important implications for the design of vaccines and drugs, diagnostics, disease surveillance and the discovery of unknown, potential host-pathogen interactions. The generic workflow presented herein is potentially applicable to a variety of pathogens, such as of viral, bacterial or parasitic origin.
dc.identifier.pubmed34583643
dc.identifier.urihttps://hdl.handle.net/20.500.12645/37948
dc.language.isoen
dc.subjectBioinformatics
dc.subjectCross-reactivity
dc.subjectCrossreactome
dc.subjectDengue virus
dc.subjectFlaviviridae
dc.subjectFlavivirus
dc.subjectHepacivirus
dc.subjectHepatitis C virus
dc.subjectHost-pathogen
dc.subjectLarge-scale
dc.subjectMethodology
dc.subjectPegivirus
dc.subjectPeptide overlap
dc.subjectPeptide sharing
dc.subjectPestivirus
dc.subjectShare-ome
dc.subjectShared sequences
dc.subjectWest Nile virus
dc.subjectand Molecular mimicry.
dc.titleA systematic bioinformatics approach for large-scale identification and characterization of host-pathogen shared sequences.
dspace.entity.typePublication
local.indexed.atPubMed

Files