Abstract:
|
Proteins represent the most important groups of biomoleculs. Di erent
functions that they carry out in each organism are unique and irreplaceable,
including versatile cellular processes, structural role of proteins, catalytic
function, a number of metabolic functions and so on. Knowing and under-
standing protein function is therefore essential in investigation of any biolo-
gical process, especially of human diseases since a lot of them are caused by
functional mutations.
In this paper, we represent investigation of protein function domain through
two di erent approaches. In the rst one, protein function is represented
by GO ontologies with the structure of a directed acyclic graph. There are
three GO ontologies: one for functions regarding biological processes, one
for functions regarding cellular components and one for molecular functions.
Each ontology contains several thousands of nodes, where every node deter-
mines more speci c function than his ascendants. The task of this part of
research was to develop a software for predicting protein function from its
primary sequence based on structural support vector machines method which
represents generalization of well-known support vector machines method on
structural output.
Structure-function paradigm is one of basic concepts in molecular biology,
stating that 3D proten structure is closely connected to its role in organism.
It has been detected that disordered proteins (the ones that lack 3D struc-
ture) and disordered regions of proteins are related with severe contemporary
illnesses, which contributed to their popularity in modern research. In an-
other aspect, we investigated the relationship between proteins' functional
categories and their disorder, as well ad with other physico-chemical char-
acteristics of proteins. Here, protein function has been observed through
25 elementary functions grouped in 4 functional groups. In this work, we
present results of thorough analysis over large protein dataset where dis-
order has been determined computationally, using publicly available tools. |