The purpose of this project is to predict active and inactive regulatory regions using machine learning. To train the models, different types of data are used: epigenomic data and sequence data. The epigenomic data represents the intersection between the DNA and proteins, while the sequence data are simply chromosomal sequences.
The genome data are preprocessed in order to increase the models' performances. The learning algorithms tested are decision tree, random forest, perceptron, multi-layer perceptron, feed-forward neural network and convolutional neural network.
A detailed report is contained in latex
folder.
-
Notifications
You must be signed in to change notification settings - Fork 0
micheleantonazzi/bioinformatics-project
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Analyze the active regulatory region of DNA using FFNN and CNN
Topics
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published