avijit9

🎯

Focusing

Avijit Dasgupta avijit9

🎯

Focusing

PhD @ IIIT Hyderabad

88 followers · 379 following

Hyderabad, India
https://avijit9.github.io/

Achievements

Stars

zhuokaizhao / academia_cv_template

TeX 145 17 Updated Dec 28, 2025

DAVIAN-Robotics / EgoX

Code for "EgoX: Egocentric Video Generation from a Single Exocentric Video"

Python 458 22 Updated Jan 2, 2026

Video-Reason / Awesome-Video-Reasoning

This is a collection of recent papers on reasoning in video generation models.

91 2 Updated Jan 8, 2026

aniket004 / DuoLoRA

DuoLoRA implementation

Python 7 Updated Oct 18, 2025

luigifreda / pyslam

pySLAM is a hybrid Python/C++ Visual SLAM pipeline supporting monocular, stereo, and RGB-D cameras. It provides a broad set of modern local and global feature extractors, multiple loop-closure stra…

Python 2,790 451 Updated Jan 8, 2026

lucas-ventura / chapter-llama

Official PyTorch implementation of the paper "Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs"

Python 87 13 Updated Jun 6, 2025

OpenBMB / MiniCPM-V

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,592 1,707 Updated Sep 24, 2025

zhengrongz / AoTD

[CVPR 2025] Official PyTorch code of "Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation".

Python 54 Updated May 25, 2025

showlab / UniVTG

[ICCV 2023] UniVTG: Towards Unified Video-Language Temporal Grounding

Python 373 34 Updated May 8, 2024

KAIST-Visual-AI-Group / Diffusion-Assignment1-DDPM

Jupyter Notebook 40 32 Updated Feb 8, 2025

pangzhan27 / GTLA

Group-wise Temporal Logit Adjustment for TAS

Python 10 Updated Oct 24, 2024

kuleshov-group / awesome-discrete-diffusion-models

A curated list for awesome discrete diffusion models resources.

524 19 Updated Sep 9, 2025

facebookresearch / vggt

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,165 1,288 Updated Oct 11, 2025

sidgairo18 / simple_diffusion_models

Building simple diffusion models for image generation. More so for understanding and learning.

Python 8 2 Updated Mar 30, 2025

DavidZhang73 / TDGV

[WACV'25] Temporal Instructional Diagram Grounding in Unconstrained Videos

Python 5 Updated Dec 17, 2024

anucvml / vidat

Video Annotation Tool

Vue 233 32 Updated Jun 18, 2024

jmhb0 / viddiff

[ICLR 2025] Video Action Differencing

Python 49 2 Updated Jul 3, 2025

presmihaylov / booknotes

A collection of my book notes on various subjects, mainly computer science

Java 2,930 760 Updated Mar 1, 2025

olga-zats / GTDA

[ECCV2024] Gated Temporal Action Anticipation for Stochastic Long-Term Anticipation

Python 22 1 Updated May 29, 2025

zihuixue / AlignEgoExo

Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment" (NeurIPS 2023)

Python 19 3 Updated Apr 5, 2024

tovacinni / research-website-template

React + Next.js template for research websites (for PhD students, researchers, etc)

TypeScript 217 90 Updated Jan 12, 2025

yiyixuxu / TimeSformer-rolled-attention

Visualizing the learned space-time attention using Attention Rollout

Jupyter Notebook 41 8 Updated Apr 1, 2022

ml-explore / mlx

MLX: An array framework for Apple silicon

C++ 23,403 1,447 Updated Jan 8, 2026

SimarKareer / EgoMimic

Jupyter Notebook 152 14 Updated Nov 10, 2024

jingyi0000 / VLM_survey

Collection of AWESOME vision-language models for vision tasks

3,052 232 Updated Oct 14, 2025

chalk-diagrams / chalk

A declarative drawing API in Python

Python 298 15 Updated Aug 28, 2024

BoltzmannEntropy / interviews.ai

It is my belief that you, the postgraduate students and job-seekers for whom the book is primarily meant will benefit from reading it; however, it is my hope that even the most experienced research…

4,785 327 Updated Aug 22, 2025

zhengli97 / Awesome-Prompt-Adapter-Learning-for-VLMs

A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.

739 37 Updated Dec 1, 2025

yenchenlin / nerf-pytorch

A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.

Python 6,001 1,132 Updated Jul 25, 2024

ViLab-UCSD / LaGTran_ICML2024

Code and models for the ICML 2024 paper "Tell, Don`t Show!: Language Guidance Eases Transfer Across Domains in Images and Videos"

Python 6 1 Updated May 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avijit Dasgupta avijit9

Achievements

Achievements

Block or report avijit9

Stars

zhuokaizhao / academia_cv_template

DAVIAN-Robotics / EgoX

Video-Reason / Awesome-Video-Reasoning

aniket004 / DuoLoRA

luigifreda / pyslam

lucas-ventura / chapter-llama

OpenBMB / MiniCPM-V

zhengrongz / AoTD

showlab / UniVTG

KAIST-Visual-AI-Group / Diffusion-Assignment1-DDPM

pangzhan27 / GTLA

kuleshov-group / awesome-discrete-diffusion-models

facebookresearch / vggt

sidgairo18 / simple_diffusion_models

DavidZhang73 / TDGV

anucvml / vidat

jmhb0 / viddiff

presmihaylov / booknotes

olga-zats / GTDA

zihuixue / AlignEgoExo

tovacinni / research-website-template

yiyixuxu / TimeSformer-rolled-attention

ml-explore / mlx

SimarKareer / EgoMimic

jingyi0000 / VLM_survey

chalk-diagrams / chalk

BoltzmannEntropy / interviews.ai

zhengli97 / Awesome-Prompt-Adapter-Learning-for-VLMs

yenchenlin / nerf-pytorch

ViLab-UCSD / LaGTran_ICML2024