Skip to content
View slinusc's full-sized avatar

Highlights

  • Pro

Block or report slinusc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Bench360 is a modular benchmarking suite for local LLM deployments. It offers a full-stack, extensible pipeline to evaluate the latency, throughput, quality, and cost of LLM inference on consumer a…

Python 4 3 Updated Sep 24, 2025

MediaWiki scraper: all your wiki articles in one highly compressed ZIM file

TypeScript 410 93 Updated Nov 25, 2025

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 47,955 6,725 Updated Jun 11, 2025

Python API to access and manipulate ELOG.

Python 22 14 Updated Sep 15, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Svelte 116,379 16,326 Updated Nov 27, 2025
Python 24 1 Updated Aug 1, 2024

A curated list for Efficient Large Language Models

Python 1,903 146 Updated Jun 17, 2025

Enhancing Medical Question-Answering System through Advanced Information Retrieval Strategies and Integration of GPT-3.5

Jupyter Notebook 24 7 Updated Sep 25, 2025

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 2,079 187 Updated Jun 30, 2025
Jupyter Notebook 11 1 Updated Jun 26, 2025

Comparison of Language Model Inference Engines

235 9 Updated Dec 16, 2024

Evaluating the Effectiveness of Transformer Layers in Wav2Vec 2.0, XLS-R, and Whisper for Speaker Identification Tasks

Jupyter Notebook 3 1 Updated Sep 2, 2025

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,757 324 Updated Nov 11, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,623 2,235 Updated Feb 1, 2025

Code for MedCPT, a model for zero-shot biomedical information retrieval.

Python 218 21 Updated Mar 24, 2024

[ACL 2022] LinkBERT: A Knowledgeable Language Model 😎 Pretrained with Document Links

Python 448 43 Updated Apr 5, 2022

A full spaCy pipeline and models for scientific/biomedical documents.

Python 1,900 249 Updated Nov 17, 2025

Official repository of the MIRAGE benchmark

Python 182 22 Updated Nov 3, 2024

Meditron is a suite of open-source medical Large Language Models (LLMs).

Python 2,115 205 Updated Apr 10, 2024

Official Implementation of NeurIPS 2024 paper "G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering""

Python 505 90 Updated Mar 19, 2025