WP — 023
From Weight-Space Generation to DeepSDF: A Twelve-Phase Image-to-3-D Research Archive, the Weight-Space Dominance Diagnostic, and an Image-Conditioned Latent-Diffusion Pipeline that Works at the 976-Shape Scale
Twelve-phase image-to-3-D project. The weight-space hypothesis (shapes are decoder weights) fails for image-conditioned generation: warm-started per-shape weights are ≥96% shared anchor, the ≤4% per-shape signal is too thin for diffusion to extract — mode collapse survives every ablation. The autoencoder rescue fails too (cos=0.997 yet broken meshes). The DeepSDF pivot constructs a 64-dim latent rather than extracting one, and works: perfect recall + category-appropriate OOD at 976 shapes. Code, 26GB dataset, live HF Space, 30-page thesis public.
May 2026
→