Shreshth Saini

Applied Scientist Intern | Amazon - Perception Team
Seattle, Washington | June 2024 – August 2024

Worked with the Perception team on large-scale synthetic data generation
Developed novel edit-bench and T2I-based diffusion model for consistent image/video editing and generation
Aiming to conduct Image+Video editing challenge and workshop

Research Intern | Alibaba Group
Sunnyvale, California | January 2024 – May 2024

Developed generalizable and robust Vision Model-based Video Quality Assessment (VQA) methods
Using Diffusion Model priors as perceptual consistency for IQA (Paper: under review)

Co-Founder | Short-X
Austin, Texas | January 2023 – January 2024

Short-X aims to automate the arduous task of making short-form contents from traditional long-form content
Built core AI models and pipelines for Short-X, working on transcription, extracting semantically meaningful and unique highlights, removing pauses, identifying speaker and smart vertical cropping

Graduate Research Assistant | Laboratory for Image and Video Engineering, UT Austin
Austin, Texas | August 2022 – Present

Developing scalable vision models for HDR videos for tasks like ITM/TM, gamut expansion & quality assessment
Created the largest HDR-SDR dataset for short-form videos (publicly available)
Developing video quality assessment methods for HDR videos, which uses Non-Linear expansion of extremes of sub-level luminance

Machine Learning Engineer | BioMind (Products)
Singapore, Singapore | February 2022 – June 2022

Developed SOTA multimodal DL models for segmentation and classification of 25+ tumor/non-tumor classes
Exploited TFRecords for memory-intense 4D datasets and proposed multi-task model for tumor predictions

Research Engineer – AI | Arkray, Inc.
Kyoto, Japan (Remote) | August 2020 – December 2021

Proposed semi-supervised DL models to learn from a large chunk of the private unlabelled and noisy 2D datasets
Deployed models for products: UrineSediment Analyzer, and automated BodyFluid Analyzer (Aution EYE)

Research Assistant | National University of Singapore
Singapore | May 2019 – July 2019
Supervisor: Dr. Mengling 'Mornin' Feng

Developed novel deep learning architecture for large-scale public health datasets
Published SOTA results with low cost for skin lesion analysis

Undergraduate Researcher | Image Processing and Computer Vision Lab, IIT Jodhpur
Jodhpur, India | August 2018 – August 2020
Supervisor: Dr. Anil Kumar Tiwari

Worked on developing ML methods aimed for AI-based diagnosis and treatment support
Developed DL models for retinal vessel & skin lesion segmentation, and diagnosis of left-atrium in 3D GE-MRIs

Research Intern | The Multimedia Analytics, Networks and Systems Lab, IIT Mandi
Mandi, India | May 2018 – July 2018
Supervisor: Dr. Aditya Nigam

Developed novel CNN model for iris segmentation which uses cascaded hourglass modules at the bottleneck of encoder-decoder design