The International Conference on Computer Vision (ICCV 2021)
will be hosted virtually next week. We’re excited to share all the work from SAIL that will be presented, and you’ll find links to papers, videos and blogs below. Feel free to reach out to the contact authors directly to learn more about the work that’s happening at Stanford!
List of Accepted Papers
GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-efficient Medical Image Recognition
Authors: Mars Huang
Contact: mschuang@stanford.edu
Keywords: medical image, self-supervised learning, multimodal fusion
3D Shape Generation and Completion Through Point-Voxel Diffusion
Authors: Linqi Zhou, Yilun Du, Jiajun Wu
Contact: linqizhou@stanford.edu
Links: Paper | Video | Website
Keywords: diffusion, shape generation
CAPTRA: CAtegory-level Pose Tracking for Rigid and Articulated Objects from Point Clouds
Authors: Yijia Weng*, He Wang*, Qiang Zhou, Yuzhe Qin, Yueqi Duan, Qingnan Fan, Baoquan Chen, Hao Su, Leonidas J. Guibas
Contact: yijiaw@stanford.edu
Award nominations: Oral Presentation
Links: Paper | Video | Website
Keywords: category-level object pose tracking, articulated objects
Detecting Human-Object Relationships in Videos
Authors: Jingwei Ji, Rishi Desai, Juan Carlos Niebles
Contact: jingweij@cs.stanford.edu
Links: Paper
Keywords: human-object relationships, video, detection, transformer, spatio-temporal reasoning
Geography-Aware Self-Supervised Learning
Authors: Kumar Ayush, Burak Uzkent, Chenlin Meng, Kumar Tanmay, Marshall Burke, David Lobell, Stefano Ermon
Contact: kayush@cs.stanford.edu, chenlin@stanford.edu
Links: Paper | Website
Keywords: self-supervised learning, contrastive learning, remote sensing, spatio-temporal, classification, object detection, segmentation
HuMoR: 3D Human Motion Model for Robust Pose Estimation
Authors: Davis Rempe, Tolga Birdal, Aaron Hertzmann, Jimei Yang, Srinath Sridhar, Leonidas Guibas
Contact: drempe@stanford.edu
Award nominations: Oral Presentation
Links: Paper | Website
Keywords: 3d human pose estimation; 3d human motion; generative modeling
Learning Privacy-preserving Optics for Human Pose Estimation
Authors: Carlos Hinojosa, Juan Carlos Niebles, Henry Arguello
Contact: carlos.hinojosa@saber.uis.edu.co
Links: Paper | Website
Keywords: computational photography; fairness, accountability, transparency, and ethics in vision; gestures and body pose
Learning Temporal Dynamics from Cycles in Narrated Video
Authors: Dave Epstein, Jiajun Wu, Cordelia Schmid, Chen Sun
Contact: jiajunwu@cs.stanford.edu
Links: Paper | Website
Keywords: multi-modal learning, cycle consistency, video
Vector Neurons: A General Framework for SO(3)-Equivariant Networks
Authors: Congyue Deng, Or Litany, Yueqi Duan, Adrien Poulenard, Andrea Tagliasacchi, Leonidas Guibas
Contact: congyue@stanford.edu
Links: Paper | Video | Website
Keywords: pointcloud network, rotation equivariance, rotation invariance
Neural Radiance for 4D View Synthesis and Video Processing
Authors: Yilun Du, Yinan Zhang, Hong-Xing Yu, Joshua B. Tenenbaum, Jiajun Wu
Contact: jiajunwu@cs.stanford.edu
Links: Paper | Website
Keywords: 4d representation, neural rendering, video processing
Where2Act: From Pixels to Actions for Articulated 3D Objects
Authors: Kaichun Mo, Leonidas J. Guibas, Mustafa Mukadam, Abhinav Gupta, Shubham Tulsiani
Contact: kaichunm@stanford.edu
Links: Paper | Website
Keywords: 3d computer vision, robotic vision, affordance learning, robot learning
Low-Shot Validation: Active Importance Sampling for Estimating Classifier Performance on Rare Categories
Authors: Fait Poms*, Vishnu Sarukkai*, Ravi Teja Mullapudi, Nimit S. Sohoni, William R. Mark, Deva Ramanan, Kayvon Fatahalian
Contact: sarukkai@stanford.edu
Links: Paper | Blog | Video
Keywords: model evaluation, active learning
We look forward to seeing you at ICCV 2021!