Proteomics data loaders for the AnnData format.
This package provides loader functions to import proteomics data (e.g., MaxQuant) into the AnnData structure for downstream analysis and easy integration with single-cell and multi-omics workflows.
- Multiple formats: Support for MaxQuant, FragPipe, DIA-NN, and mzTab files
- Reads metadata: Automatically extracts and organizes sample and protein metadata
Protdata has minimal dependencies listed in pyproject.toml
To install the latest release from PyPI, run:
pip install protdataOr install from source:
git clone https://github.com/czbiohub-sf/protdata.git
cd protdata
pip install -e . # or make setup-develop for developersYou can download an example proteinGroups file here
import protdata
adata = load_maxquant_to_anndata("/path/to/proteinGroups.txt")
print(adata)You can download an example DIA-NN report file here
from protdata.io import read_diann
adata = read_diann("/path/to/report.pg_matrix.tsv")
print(adata)You can download an example FragPipe output file here
from protdata.io import read_fragpipe
adata = read_fragpipe("/path/to/combined_protein.tsv")
print(adata)You can download an example mzTab file here
from protdata.io import read_mztab
adata = read_mztab("/path/to/SILAC_SQ.mzTab")
print(adata)protdata is created and maintained by the Computational Biology Platform at the Chan Zuckerberg Biohub San Francisco.
To get in touch please use the GihHub issues page.
If you want to contribute to protdata, please read the Contribution Guide
See Release Notes