Publications

(2024). xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations. arXiv:2408.12590 (also appearing at ECCV24 AI4VA Workshop).

PDF Code

(2024). xGen-MM (BLIP-3): A Family of Open Large Multimodal Models. arXiv preprint arXiv:2408.08872 (2024).

PDF Models

(2023). Diffusion Model Alignment Using Direct Preference Optimization. Conference on Computer Vision and Pattern Recognition (CVPR) 2024 .

PDF Code

(2023). ConRad: Image Constrained Radiance Fields for 3D Generation from a Single Image. Advances in Neural Information Processing Systems 33 (NeurIPS 2023).

PDF Poster

(2021). The Functional Correspondence Problem. International Conference on Computer Vision (ICCV) 2021.

PDF Project Page

(2021). Audio-Visual Floorplan Reconstruction. International Conference on Computer Vision (ICCV) 2021.

PDF Code Talk (5min) Slides Poster

(2016). Pose from Action: Unsupervised Learning of Pose Features based on Motion. Workshop on Action and Anticipation for Visual Learning at ECCV 2016..

PDF Poster

(2016). Stochastic Multiple Choice Learning for Training Diverse Deep Ensembles. Advances in Neural Information Processing Systems (NIPS) 2016.

PDF Poster

(2015). Why M Heads are Better than One: Training a Diverse Ensemble of Deep Networks. arxiv preprint arXiv:1511.06314.

PDF

(2014). Combining the Best of Graphical Models and ConvNets for Semantic Segmentation. arxiv preprint arXiv:1412.4313.

PDF

(2013). Automatic Segmentation of Adipose Tissue from Thigh Magnetic Resonance Images. International Conference on Image Analysis and Recognition (ICIAR) 2013.

PDF