400 native Japanese speakers involved, balanced for gender. The recording corpus is rich in content, and it covers a wide domain such as generic command and control category, human-machine interaction category, smart home category, in-car category. The transcription corpus has been manually proofread to ensure high accuracy. Format：16kHz, 16bit, uncompressed wav, mono channel. Environment：moderately quiet indoor environment, without echo. Recording content：generic category; human-machine interaction category; smart home command and control category; in-car command and control category; numbers. Demographics：464 Japanese, 53% of which are Female. Device：Android mobile phone, iPhone. Language：English Applications：speech recognition, voiceprint recognition.