Skip to content
View gsajko's full-sized avatar

Block or report gsajko

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Let's all work together on a 3d scanner benchmark for desktop 3d scanners

116 1 Updated Mar 6, 2025

Classroom-ready open-source educational exoskeleton for biomedical and control engineering

TeX 5 4 Updated Jan 30, 2025

A repository for research on medium sized language models.

Python 516 73 Updated Jun 6, 2025

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 7,094 496 Updated Oct 30, 2025

Twitter Scraper

Python 627 88 Updated Jul 15, 2025

Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI

Python 221 23 Updated Apr 29, 2024
Python 196 11 Updated May 5, 2024

auto fine tune of models with synthetic data

Python 75 4 Updated Feb 14, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

66,428 7,487 Updated Jun 4, 2025

Minimalistic large language model 3D-parallelism training

Python 2,283 251 Updated Sep 3, 2025

structured outputs for llms

Python 11,737 878 Updated Oct 31, 2025

LUI: Autonomous Collective Decision Making via Large Language Models

Python 105 6 Updated Apr 23, 2023

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…

Rust 5,626 461 Updated Oct 31, 2025
Jupyter Notebook 339 54 Updated Jun 26, 2023

Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼

Jupyter Notebook 33,293 7,073 Updated Oct 15, 2025

Scripts to create a basic search on podcast data in general

Python 10 1 Updated Dec 23, 2022

An end-to-end implementation of intent prediction with Metaflow and other cool tools

Python 872 66 Updated Jun 16, 2023

Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.

Python 1,392 167 Updated Apr 17, 2025

Next.js app for serverless deployments of OpenAI Whisper on Banana.dev

JavaScript 96 34 Updated Sep 22, 2022

AI-powered CLI tool to help you remember bash commands.

Rust 333 16 Updated Jul 6, 2024

Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.

Jupyter Notebook 228 37 Updated Sep 12, 2022

An Obsidian.md plugin to save tweets as Markdown files.

TypeScript 209 13 Updated May 8, 2023

Supporting materials/code examples for my course in data engineering for machine learning.

Python 38 6 Updated Nov 15, 2022

Resumes generated using the GitHub informations

JavaScript 62,675 1,361 Updated Feb 15, 2023

An underground, wireless, open-source, low-cost system for monitoring oxygen, temperature, and soil moisture

C++ 7 Updated Nov 19, 2021

Free MLOps course from DataTalks.Club

Jupyter Notebook 13,564 2,713 Updated Oct 15, 2025

Building a real-time twitter graph of your friends

C# 264 14 Updated May 15, 2022

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Python 5,145 415 Updated Oct 27, 2025

Repo for Ecosystem Creator project based on Synthetic Silviculture Paper

C++ 4 Updated Nov 2, 2021

a cheat-sheet for mathematical notation in code form

15,437 1,100 Updated Mar 8, 2022
Next