Benedict.

Back to Dex
SYSTEM: ACCESSING_RECORD_NVIDIA-RESEARCH...|STATUS: VERIFIED|
Generative AI Researcher Intern
BRAIN_DUMP

"Was shocked to find our original engine would take centuries to generate the dataset we needed, but we pulled through... An awesome virgin experience to multimodal AI research!"

#Generative AI Researcher Intern

>_ Mission Log

[+] MANUSCRIPT IN PREPARATION: Unified Framework for Evaluating Vision Models for Action Recognition in Surveillance Context.

[+] Engineered a scalable synthetic-data engine using open-source datasets (SMPL-X, AMASS, and BABEL) and implemented an auto-labeling mechanism (pose/bbox/mask), which accelerates CV/VLM data collection.

[+] Developed a DORI-aligned surveillance dataset (200k+ images & 630k+ QA pairs) with controlled context parameters to enable robust benchmarking of presence & action recognition models.

[+] Built a unified evaluation framework for human-presence & action recognition, with Hydra configs and Docker/CUDA to benchmark YOLO models against compact LVLMs.

</> Skills & Expertise

PyTorch
CUDA
Blender
LVLMs
Multimodal