Skip to content
View medhini's full-sized avatar
🎯
🎯

Highlights

  • Pro

Block or report medhini

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Jupyter Notebook 7,615 1,371 Updated Nov 28, 2025

[ICCV 2023] UniVTG: Towards Unified Video-Language Temporal Grounding

Python 373 34 Updated May 8, 2024
Python 83 6 Updated Jul 16, 2023

Learning and Verification of Task Structure in Instructional Videos

4 Updated Mar 23, 2023

Code for paper, "TL;DW? Summarizing Instructional Videos with Task Relevance & Cross-Modal Saliency" ECCV 2022

Python 39 2 Updated Feb 17, 2023

Audio-conditioned video texture generation

Python 24 1 Updated Sep 16, 2022

This is an official pytorch implementation of Learning To Recognize Procedural Activities with Distant Supervision. In this repository, we provide PyTorch code for training and testing as described…

Python 43 3 Updated Feb 21, 2023

[CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.

Python 119 4 Updated Oct 9, 2023

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python 1 Updated Mar 15, 2023

S3D Text-Video model trained on HowTo100M using MIL-NCE

Python 200 21 Updated Jul 3, 2020

Easy to use video deep features extractor

Python 322 70 Updated Jul 5, 2020

This is a working demo of the pegasus summarization model trained on cnn_dailymail

Python 29 6 Updated Oct 7, 2020

Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.

Python 1,008 276 Updated Oct 5, 2023

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 32,309 3,885 Updated Jul 23, 2024

Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.

Python 643 105 Updated Jan 31, 2025

Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)

Jupyter Notebook 230 57 Updated Apr 8, 2023

Unsupervised video summarization with deep reinforcement learning (AAAI'18)

Python 503 152 Updated Dec 11, 2023

A PyTorch implementation of the Transformer model from "Attention Is All You Need".

Python 60 11 Updated Jul 13, 2019

Latex code for making neural networks diagrams

TeX 24,346 3,037 Updated Aug 21, 2023

Simple project webpage template. Originally used in Colorful Image Colorization. ECCV, 2016.

HTML 486 168 Updated Oct 20, 2020

This repository contains the source code for the paper First Order Motion Model for Image Animation

Jupyter Notebook 14,989 3,285 Updated Nov 14, 2024

Script for converting the pretrained VGGish model provided with AudioSet from TensorFlow to PyTorch, along with a basic smoke test.

Python 87 10 Updated May 16, 2019

Instructional notebooks on music information retrieval.

Jupyter Notebook 1,263 417 Updated Jan 16, 2026

PyTorch implementation of Super SloMo by Jiang et al.

Python 3,029 485 Updated Mar 9, 2023

Audio To Body Dynamics, CVPR 2018

Python 119 26 Updated Oct 30, 2018

PyTorch implementations of Generative Adversarial Networks.

Python 17,404 4,101 Updated Jun 18, 2024
Python 4 Updated Dec 7, 2019

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Python 1,088 217 Updated Oct 23, 2024

TensorFlow implementation for audio neural style.

Jupyter Notebook 451 112 Updated Apr 23, 2022

Deep Audio-Visual Embedding network (DAVEnet) implementation in PyTorch

Python 65 19 Updated Aug 31, 2018
Next