ChatterBox: Multi-round Multimodal Referring and Grounding In this study, we establish a baseline for a new task named multimodal multi-round referring and grounding (MRG), opening up a promising direction for instance-level multimodal dialogues. admin_sagi2024-02-15T05:40:20+00:00February 15, 2024|