I enjoy animating scenes on my O gauge layout, and have come across a great way to get slower, more-precise realistic action: ...
TL;DR: Given a 3D semantic layout, SpatialGen can generate a 3D indoor scene conditioned on either a reference image (left) or a textual description (right) using a multi-view, multi-modal diffusion ...