A Python-based tool for analyzing DNA sequences from FASTA files. This tool provides various functionalities for DNA sequence analysis including base counting, complement sequence generation, RNA transcription, GC content calculation, and motif finding.
- Read and parse FASTA files
- Count nucleotide bases (A, C, G, T)
- Generate complementary DNA sequences
- Generate reverse complementary sequences
- Transcribe DNA to RNA
- Calculate GC content
- Find specific motifs in the sequence
- Python 3.x
- BioPython library
- Clone this repository:
git clone https://github.com/yourusername/DNA_Analysis_Tool.git
cd DNA_Analysis_Tool- Create and activate a virtual environment (recommended):
python -m venv venv
source venv/bin/activate # On Windows, use: venv\Scripts\activate- Install required packages:
pip install biopython- Place your FASTA file in the
datadirectory - Run the script:
python dna_analysis.pyThe script will analyze the DNA sequence and display:
- Sequence length
- Base counts
- Complementary sequence (first 60 bases)
- Reverse complementary sequence (first 60 bases)
- RNA transcription (first 60 bases)
- GC content
- Positions of the specified motif (default: "ATG")
DNA Sequence Length: [length]
Base Count:
A: [count]
C: [count]
G: [count]
T: [count]
Complementary Sequence (first 60 bases):
[sequence]...
Reverse Complementary Sequence (first 60 bases):
[sequence]...
RNA Sequence (first 60 bases):
[sequence]...
GC Content: [percentage]%
Motif 'ATG' found at positions:
[positions]
Contributions are welcome! Please feel free to submit a Pull Request.
This project is licensed under the MIT License - see the LICENSE file for details.
- BioPython library for FASTA file handling
- The scientific community for DNA analysis methodologies