Skip to content
View guenthermi's full-sized avatar

Organizations

@Wikidata @jina-ai @embeddings-benchmark

Block or report guenthermi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Model implementation for the contextual embeddings project

Python 39 1 Updated Jun 2, 2025

German dataset for DPR model training

Jupyter Notebook 19 1 Updated Jul 21, 2024

Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint

Python 421 26 Updated Mar 26, 2024

Hybrid search engine, combining best features of text and semantic search worlds

Scala 594 16 Updated Jan 6, 2026

MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW

Python 2,849 312 Updated Jan 2, 2026

[ICLR 2023 Oral] Image as Set of Points

Python 574 42 Updated Apr 26, 2024

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,989 2,217 Updated Jul 29, 2024

Chain together LLMs for reasoning & orchestrate multiple large models for accomplishing complex tasks

Python 610 54 Updated Apr 11, 2023

Towards an open source stack for e-commerce search

Ruby 150 32 Updated Oct 8, 2025

An open source implementation of CLIP.

Python 13,219 1,222 Updated Nov 4, 2025

Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptatio…

Python 338 37 Updated Jul 6, 2023

State-of-the-Art Text Embeddings

Python 18,088 2,723 Updated Jan 8, 2026

🎯 Task-oriented embedding tuning for BERT, CLIP, etc.

Python 1,508 69 Updated Mar 11, 2024

Simplify deploying and managing Jina projects on Jina Cloud

Python 299 12 Updated Oct 23, 2023

☁️ Build multimodal AI applications with cloud-native stack

Python 21,819 2,241 Updated Mar 24, 2025

A tool for manually classification of dwtc tables. The result is then being used as a training data set.

Java 2 1 Updated Jul 25, 2023
HTML 1 Updated Mar 9, 2020
JavaScript 1 2 Updated Jul 19, 2018

A collection of free Bootstrap 5 templates.

3,074 997 Updated Jul 25, 2024

A tool to analyse, browse and query Wikidata

TypeScript 84 17 Updated May 13, 2025

Examples showing how to use Wikidata Toolkit as a Maven library in your project

Java 54 23 Updated Sep 10, 2025

Java library to interact with Wikibase

Java 401 111 Updated Jan 6, 2026

This repo contains tutorials on OpenCV-Python library using new cv2 interface

Python 1,271 881 Updated Apr 25, 2021