Posts

Articles by Jesse Polhemus

PackUV: Video-Native Representations For Streaming 4D Scenes

A series of images showing how PackUV uses Gaussian UV fitting and produces one representation with all attributes that's 100% video-codec compatible

Imagine watching a concert not from a fixed camera angle, but from any angle. The catch? Volumetric video is incredibly hard to store and stream, and you can have the most photorealistic 4D scene in the world, but if you can't get it to a viewer efficiently, it's stuck in a lab. Our work, PackUV, tackles exactly this problem.

VideoGPA: Distilling Geometry Priors For 3D-Consistent Video Generation

Eight images of a hallway that demonstrate VideoGPA's superiority over a baseline model

In this paper, we leverage a 3D Geometric Foundation Model to build a self-supervised pipeline that evaluates 3D consistency in AI-generated videos. By integrating our video generation model with reinforcement learning, we are able to generate highly 3D-coherent and realistic videos. This approach significantly reduces morphing, flickering, and artifacts, outperforming current state-of-the-art methods.

Back