Despite encouraging progress in 3D scene understanding, it remains challenging to develop an effective Large Multi-modal Model (LMM) that is capable of understanding and reasoning in complex 3D ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results