Skip to content
View MisgaXiong's full-sized avatar
  • The Chinese University of Hong Kong
  • Hong Kong SAR, China

Block or report MisgaXiong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLMs for high-throughput mining and generation of antimicrobial peptides

Python 19 5 Updated Oct 10, 2025

Genome modeling and design across all domains of life

Jupyter Notebook 3,199 368 Updated Sep 17, 2025

Deezer source separation library including pretrained models.

Python 27,766 3,048 Updated Apr 2, 2025

DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome

Python 716 176 Updated Jul 8, 2025

An R package to calculate indices and theoretical physicochemical properties of peptides and protein sequences.

R 93 24 Updated Jan 23, 2024
Python 3 3 Updated Feb 15, 2025

《动手学大模型Dive into LLMs》系列编程实践教程

Jupyter Notebook 9,747 980 Updated Oct 10, 2025
Python 902 70 Updated Oct 24, 2025

[ICLR 2024] DNABERT-2: Efficient Foundation Model and Benchmark for Multi-Species Genome

Shell 435 89 Updated Aug 14, 2025

User friendly and accurate binder design pipeline

Python 908 209 Updated Aug 12, 2025

Source code of ProDMM. The paper is titled with "Unveiling Protein-DNA Interdependency: Harnessing Unified Multimodal Sequence Modeling, Understanding and Generation".

Python 10 1 Updated Nov 21, 2024
Jupyter Notebook 560 106 Updated Apr 3, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,635 31,157 Updated Nov 17, 2025

UniRep model, usage, and examples.

Python 358 97 Updated Jun 21, 2022

Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology.

Python 721 133 Updated Dec 11, 2022

The second version of the Kraken taxonomic sequence classification system

C++ 846 293 Updated Nov 7, 2025

tools for working with Bisulfite Sequencing data while preserving reads intrinsic dependencies

Python 172 50 Updated Nov 8, 2025
Python 49 17 Updated Jan 4, 2023

Code repository accompanying the manuscript, Symbiont loss and gain, rather than co-diversification shapes honeybee gut microbiota diversity and function

Python 10 1 Updated Feb 21, 2025
Python 4 Updated Oct 21, 2025

Visualize outputs of AmpliconArchitect and AmpliconReconstructor in Circos-style images.

Python 28 6 Updated Oct 10, 2025

The complete sequence of a human genome

998 100 Updated Jul 14, 2025

Kuhlman Lab Installation of AlphaFold3

Python 37 5 Updated Sep 30, 2025

A topic-centric list of HQ open datasets.

70,552 10,912 Updated Nov 5, 2025
Python 3 Updated Jul 15, 2025

A reproducible Snakemake pipeline for end-to-end cell-free DNA (cfDNA) fragmentomics analysis (WPS, TSS, CTCF, motifs,MDS)).

Python 1 Updated Jul 27, 2025

A Snakemake pipeline for processing and analysis of cell-free RNA (cfRNA) sequencing data.

Python 2 Updated Jul 25, 2025

Scripts for cfRNA analysis

Python 2 8 Updated Apr 29, 2021

An community curated awesome list of tools, software, databases and other resources for working/analysing RNA Viruses

HTML 20 1 Updated Aug 15, 2025
Next