Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
TRELLIS.2: state-of-the-art large 3D generative model (4B) (github.com/microsoft)
81 points by dvrp 1 day ago | hide | past | favorite | 16 comments




I'm surprised at the lukewarm reception. Admittedly I don't follow the image-to-3D space as much, but last time I checked in, the gloopy fuzzy outputs did not impress me.

I want to highlight what I believe is the coolest innovation: their novel O-Voxel data structure. I'm still trying to wrap my head around how they figured out the conversion from voxel-space to mesh-space. Those two worlds don't work well together.

A 2D analogy is that they figured out an efficient, bidirectional, one-shot method of converting PNG's into SVG's, without iteration. Crazy.


As an old guy

"System: The code is currently tested only on Linux."

:)


TRELLIS 1 had a massive impact on the research in this area, not least because it’s actually open (full dataset, training and inference). Research like SynCity or PhysX-3D (not the NVIDIA one) wouldn’t have been possible.

Excites for the follow ups for this new generation.


State of the art for Open Source. This is a nice improvement, but far out I cannot wait for a Sparc3D equivalent for local use. Its a step change in quality. I really hope Hunyuan3D-3 is the one to level up to that quality now

To me it seems Trellis 2 has higher quality and it also generates PBR materials and textures.

Where they get training data from?

> All our model, code, and dataset will be publicly released to facilitate reproduction and further research.

Will take some time to publish. The TRELLIS 1 dataset is public here: https://github.com/microsoft/TRELLIS/blob/main/DATASET.md


You can play with a demo here: https://huggingface.co/spaces/microsoft/TRELLIS.2 (requires a hugging face acct)

The results from arbitrary pictures are not nearly as good as what's shown in the posting. So either the demo is running a gimped version of the model or the examples are _very_ handpicked.


Needs 24GB gpu to run.

If it takes 60 seconds on a GPU I can leave it running over night on a CPU. (And going off previous experience, it won't be even be that slow, I'm just being conservative.)


Got a good laugh out of these, thanks.

was fun generating the as well

these are garbage compared to what's shown in the posting.

Project website gives a nice look: https://microsoft.github.io/TRELLIS.2/

Thanks, we'll put that link in the toptext as well.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: