Date of Award:

5-2009

Document Type:

Dissertation

Degree Name:

Doctor of Philosophy (PhD)

Department:

Computer Science

Committee Chair(s)

Changhui Yan

Committee

Changhui Yan

Committee

Donald H. Cooley

Committee

Heng-Da Cheng

Committee

Xiaojun Qi

Committee

John R. Stevens

Abstract

High-throughput genomics projects have resulted in a rapid accumulation of protein sequences. Therefore, computational methods that can predict protein functions and functional sites efficiently and accurately are in high demand. In addition, prediction methods utilizing only sequence information are of particular interest because for most proteins, 3-dimensional structures are not available. However, there are several key challenges in developing methods for predicting protein function and functional sites. These challenges include the following: the construction of representative datasets to train and evaluate the method, the collection of features related to the protein functions, the selection of the most useful features, and the integration of selected features into suitable computational models. In this proposed study, we tackle these challenges by developing procedures for benchmark dataset construction and protein feature extraction, implementing efficient feature selection strategies, and developing effective machine learning algorithms for protein function and functional site predictions. We investigate these challenges in three bioinformatics tasks: the discovery of transmembrane beta-barrel (TMB) proteins in gram-negative bacterial proteomes, the identification of deleterious non-synonymous single nucleotide polymorphisms (nsSNPs), and the identification of helix-turn-helix (HTH) motifs from protein sequence.

Checksum

fed52c060b54cc6936c14ca6c44ff9dd

Recommended Citation

Hu, Jing, "Prediction of Protein Function and Functional Sites From Protein Sequences" (2009). All Graduate Theses and Dissertations, Spring 1920 to Summer 2023. 292.
https://digitalcommons.usu.edu/etd/292

Download

Included in

Biology Commons

COinS

Copyright for this work is retained by the student. If you have any questions regarding the inclusion of this work in the Digital Commons, please email us at .

DOI

https://doi.org/10.26076/06b1-92d8

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

Prediction of Protein Function and Functional Sites From Protein Sequences

Date of Award:

Document Type:

Degree Name:

Department:

Committee Chair(s)

Committee

Committee

Committee

Committee

Committee

Abstract

Checksum

Recommended Citation

Included in

DOI

Browse

For Authors

Scholarly Communication

Research Data

All Graduate Theses and Dissertations, Spring 1920 to Summer 2023

Prediction of Protein Function and Functional Sites From Protein Sequences

Author

Date of Award:

Document Type:

Degree Name:

Department:

Committee Chair(s)

Committee

Committee

Committee

Committee

Committee

Abstract

Checksum

Recommended Citation

Included in

Share

DOI

Browse

For Authors

Scholarly Communication

Research Data