TL;DR: GAGS learns a 3D Gaussian field associated with semantic features, which enables accurate open-vocabulary 3D visual grounding in the scene. Abstract: 3D open-vocabulary scene understanding, ...
Official pytorch code release of "DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation" @misc{kim2024deeptalkdynamicemotionembedding ...