Data Mining for Bioinformatics Applications by He Zengyou

By He Zengyou

Data Mining for Bioinformatics Applications presents precious details at the facts mining tools were regular for fixing actual bioinformatics difficulties, together with challenge definition, facts assortment, facts preprocessing, modeling, and validation.

The textual content makes use of an example-based solution to illustrate tips on how to follow info mining ideas to resolve genuine bioinformatics difficulties, containing forty five bioinformatics difficulties which were investigated in fresh study. for every instance, the complete info mining approach is defined, starting from info preprocessing to modeling and end result validation.

  • Provides worthy details at the info mining equipment were regular for fixing genuine bioinformatics problems
  • Uses an example-based technique to illustrate tips on how to practice info mining thoughts to unravel actual bioinformatics problems
  • Contains forty five bioinformatics difficulties which were investigated in fresh research

Show description

Read or Download Data Mining for Bioinformatics Applications PDF

Similar data modeling & design books

Medical Imaging and Augmented Reality Second International Workshop

This scholarly set of well-harmonized volumes presents essential and entire assurance of the intriguing and evolving topic of clinical imaging structures. major specialists at the foreign scene take on the most recent state of the art ideas and applied sciences in an in-depth yet eminently transparent and readable technique.


Metaheuristics convey fascinating homes like simplicity, effortless parallelizability, and prepared applicability to varieties of optimization difficulties. After a accomplished advent to the sector, the contributed chapters during this ebook contain reasons of the most metaheuristics concepts, together with simulated annealing, tabu seek, evolutionary algorithms, man made ants, and particle swarms, by means of chapters that display their functions to difficulties akin to multiobjective optimization, logistics, motor vehicle routing, and air site visitors administration.

Additional info for Data Mining for Bioinformatics Applications

Example text

For example, the entire PPI networks for most species are still not established, making it difficult to accurately evaluate the performance of PPI network inference algorithms. Therefore, more research should be devoted to the development of effective validation algorithms for assessing the data mining results in bioinformatics applications. 5 Summary Bioinformatics is a field that is still advancing rapidly, making it impossible to cover all the contents of bioinformatics even within a book. In this chapter, some bioinformatics and related data analysis tasks are introduced.

24 Data Mining for Bioinformatics Applications Because the peptide that contains one target motif must also contain its submotifs, the set of peptides that contain this motif must be a subset of the collection of peptides that contain its submotif. In particular, there are exactly k submotifs of size k À 1 for one k-motif. For each submotif of size k À 1, we can generate a set of peptides in which every peptide contains this submotif. P” is a 3-motif and has three submotifs of size two. Each submotif will induce a new foreground data set and a new background data set by selecting peptides that contain the corresponding submotif.

3 Modeling: Different learning schemes In Ref. [2], existing phosphorylation site prediction tools are summarized and categorized from different viewpoints. These techniques differ in several ways: the machine learning or data mining techniques used; the features extracted from the set of peptides; whether predictions are kinase-specific or non-kinase-specific; and the construction of training and testing data. Here we discuss the key modeling part from a different angle: the taxonomy of underlying machine learning principles.

Download PDF sample

Rated 4.42 of 5 – based on 22 votes

About the Author