Translated data: CoDA is an end-to-end open-vocabulary 3D object detection framework designed to simultaneously achieve localization and classification of new objects. The framework leverages 3D geometry and 2D semantics to jointly discover new objects in a scene and perform cross-modal feature alignment. Compared to other methods, CoDA can discover more new objects, making it highly valuable academically and promising in practical applications.