We rendered and embedded one million CAD files

(cad-search-three.vercel.app)

3 points | by DavidFerris 10 hours ago ago

4 comments

  • DavidFerris 10 hours ago ago

    We rendered the one million part ABC dataset from Deep Geometry, and open-sourced the data. We also built a fun demo with the following pipeline: CAD > render > caption > embed.

    Open-sourced dataset: https://huggingface.co/datasets/daveferbear/3d-model-images-...

    Blog writeup: https://www.finalrev.com/blog/embedding-one-million-3d-model...

  • sho_hn 9 hours ago ago

    Interesting.

    My go-to for CAD files is usually https://grabcad.com/library

    I searched this for "WAGO" and "XT90", so I guess not the same use case. Some hits for "Raspberry Pi", though.

    • DavidFerris 9 hours ago ago

      This isn't meant to be a commercially useful search engine- just a demonstration. You'll only be able to search for terms that the VLM could directly discern.

      From the blog post: Our search demo proves that it works quite well. As anticipated, text search works well, returning sensible results for even irregular or poorly formed queries. It’s worth mentioning that this is very different from 3D part libraries like Thingiverse or GrabCAD. Search in those repositories requires users to tag or annotate parts with a description, the text of which is used in search. Our system takes only an unnamed part as input, requiring no additional labelling.

      • sho_hn 9 hours ago ago

        I see, you did an AI demo of captioning and search over captures specifically for complex geometric objects.

        I guess my interest was more piqued by the "CAD" part.