pa:(Tobias Bocklet)
TECHNOLOGIES FOR ROBUST CRYING DETECTION USING TEMPORAL CHARACTERISTICS OF ACOUSTIC FEATURES
US14979108
[Joachim Hofer, Tobias Bocklet, Georg Stemmer, David Pearce, Sebastian Czyryba, Josef G. Bauer]
DE Munich
Technologies for identifying sounds are disclosed. A sound identification device may capture sound data, and split the sound data into frames. The sound identification device may then determine an acoustic feature vector for each frame, and determine parameters based on how each acoustic feature varies over the duration of time corresponding to the frames. The sound identification device may then determine if the sound matches a pre-defined sound based on the parameters. In one embodiment, the sound identification device may be a baby monitor, and the pre-defined sound may be a baby crying.
更多详情内容请
点击查看
SYSTEMS AND METHODS FOR ENERGY EFFICIENT AND LOW POWER DISTRIBUTED AUTOMATIC SPEECH RECOGNITION ON WEARABLE DEVICES
US15475329
[Binuraj K. Ravindran, Francis M. Tharappel, Prabhakar R. Datta, Tobias Bocklet, Maciej Muchlinski, Tomasz Dorau, Josef G. Bauer, Saurin Shah, Georg Stemmer]
US CA Cupertino
Methods, apparatus, systems and articles of manufacture are disclosed for distributed automatic speech recognition. An example apparatus includes a detector to process an input audio signal and identify a portion of the input audio signal including a sound to be evaluated, the sound to be evaluated organized into a plurality of audio features representing the sound. The example apparatus includes a quantizer to process the audio features using a quantization process to reduce the audio features to generate a reduced set of audio features for transmission. The example apparatus includes a transmitter to transmit the reduced set of audio features over a low-energy communication channel for processing.
更多详情内容请
点击查看
Systems and methods for energy efficient and low power distributed automatic speech recognition on wearable devices
US15475329
[Binuraj K. Ravindran, Francis M. Tharappel, Prabhakar R. Datta, Tobias Bocklet, Maciej Muchlinski, Tomasz Dorau, Josef G. Bauer, Saurin Shah, Georg Stemmer]
US CA Cupertino
Methods, apparatus, systems and articles of manufacture are disclosed for distributed automatic speech recognition. An example apparatus includes a detector to process an input audio signal and identify a portion of the input audio signal including a sound to be evaluated, the sound to be evaluated organized into a plurality of audio features representing the sound. The example apparatus includes a quantizer to process the audio features using a quantization process to reduce the audio features to generate a reduced set of audio features for transmission. The example apparatus includes a transmitter to transmit the reduced set of audio features over a low-energy communication channel for processing.
更多详情内容请
点击查看
Techniques for client-side speech domain detection using gyroscopic data and a system using the same
US15721486
[Piotr Rozen, Tobias Bocklet, Jakub Nowicki, Munir Georges]
PL Gdansk
Techniques are disclosed for client-side analysis of audio samples to identify one or more characteristics associated with captured audio. The client-side analysis may then allow a user device, e.g., a smart phone, laptop computer, in-car infotainment system, and so on, to provide the one or more identified characteristics as configuration data to a voice recognition service at or shortly after connection with the same. In turn, the voice recognition service may load one or more recognition components, e.g., language models and/or application modules/engines, based on the received configuration data. Thus, latency may be reduced based on the voice recognition engine having “hints” that allow components to be loaded without necessarily having to process audio samples first. The reduction of latency may reduce processing time relative to other approaches to voice recognitions systems that exclusively perform server-side context recognition/classification.
更多详情内容请
点击查看
TECHNIQUES FOR CLIENT-SIDE SPEECH DOMAIN DETECTION AND A SYSTEM USING THE SAME
US15721486
[PIOTR ROZEN, TOBIAS BOCKLET, JAKUB NOWICKI, MUNIR GEORGES]
PL Gdansk
Techniques are disclosed for client-side analysis of audio samples to identify one or more characteristics associated with captured audio. The client-side analysis may then allow a user device, e.g., a smart phone, laptop computer, in-car infotainment system, and so on, to provide the one or more identified characteristics as configuration data to a voice recognition service at or shortly after connection with the same. In turn, the voice recognition service may load one or more recognition components, e.g., language models and/or application modules/engines, based on the received configuration data. Thus, latency may be reduced based on the voice recognition engine having “hints” that allow components to be loaded without necessarily having to process audio samples first. The reduction of latency may reduce processing time relative to other approaches to voice recognitions systems that exclusively perform server-side context recognition/classification.
更多详情内容请
点击查看
CONCEALING PHRASES IN AUDIO TRAVELING OVER AIR
US16683686
[Munir Nikolai Alexander Georges, Joachim Hofer, Tobias Bocklet, Josef Bauer, Georg Stemmer]
DE Kehl
An example apparatus for concealing phrases in audio includes a receiver to receive a detected phrase via a network. The detected phrase is based on audio captured near a source of an audio stream. The apparatus also includes a speech recognizer to generate a trigger in response to detecting that a section of the audio stream contains a confirmed phrase. The apparatus further includes a phrase concealer to conceal the section of the audio stream in response to the trigger.
更多详情内容请
点击查看