LookOnceToHear is an innovative smart earphone interaction system that allows users to select the target speaker they want to hear by simply using visual recognition. This technology was nominated for Best Paper at CHI 2024. It achieves real-time speech extraction through synthetic audio mixing, head-related transfer functions (HRTFs), and binaural room impulse responses (BRIRs), providing users with a novel way to interact.