mini_parallel

mini core bioinformatics algorithms- Smith-Waterman, k-mer, and variant calling (with DeepVariant), all run on a 50GB WGS from Nucleus.

Smith-Waterman

DNA sequence alignment using SIMD instructions. Compares two DNA sequences and scores how well the letters line up. Match gets +2 and Mismatch gets -1.

A = A (+2)
T = T (+2)
C = T (-1)
G = G (+2)
T = G (-1)
...

For my WGS:

Direct alignment: compare to average reference genome
Complementary alignment: find what % of genome is not perfectly complementary (boooo)

Setup for WGS Processing

Environment Configuration

Create a .env file in the project root with your WGS data configuration:

# WGS Data Configuration
WGS_DATA_DIR=/path/to/your/wgs/data
WGS_SAMPLE_ID=your-sample-id
WGS_LANES=8
WGS_READS_PER_LANE=2

# GPU Configuration
GPU_CHUNK_SIZE_READS=10000
GPU_CHUNK_SIZE_BASES=1000000

Usage

# Test WGS file reading
cargo run -- --test-wgs --gpu

# Process full WGS dataset
cargo run -- --full-wgs --gpu

# Run with Nsight Systems
nsys profile -t opencl,cuda,osrt --output wgs_profile ./target/release/rustseq_mini --full-wgs --gpu

File Naming Convention

The aligner expects files named: {SAMPLE_ID}_L{LANE:03}_R{READ}_001.fastq.gz

Example: SAMPLE_001_L001_R1_001.fastq.gz

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
k_mer		k_mer
smith_waterman		smith_waterman
variant_calling		variant_calling
.gitignore		.gitignore
README.md		README.md
improvements.txt		improvements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

mini_parallel

Smith-Waterman

For my WGS:

Setup for WGS Processing

Environment Configuration

Usage

File Naming Convention

About

Uh oh!

Releases

Packages

Languages

bmwoolf/mini_parallel

Folders and files

Latest commit

History

Repository files navigation

mini_parallel

Smith-Waterman

For my WGS:

Setup for WGS Processing

Environment Configuration

Usage

File Naming Convention

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages