Starred repositories
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
A curated list of recent diffusion models for video generation, editing, and various other applications.
✨✨Latest Advances on Multimodal Large Language Models
Reaching LLaMA2 Performance with 0.1M Dollars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
High-Resolution Image Synthesis with Latent Diffusion Models
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
A latent text-to-image diffusion model
Using Low-rank adaptation to quickly fine-tune diffusion models.
This repository contains the exercises and its solution contained in the book "An Introduction to Statistical Learning" in python.
Graph Attention Networks (https://arxiv.org/abs/1710.10903)
Representation learning on large graphs using stochastic graph convolutions.
Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"
C++ 资源大全中文版,标准库、Web应用框架、人工智能、数据库、图片处理、机器学习、日志、代码分析等。由「开源前哨」和「CPP开发者」微信公号团队维护更新。
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
A treasure chest for visual classification and recognition powered by PaddlePaddle
Simple Training and Deployment of Fast End-to-End Binary Networks
pytorch implementation of "Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks"
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.