CN1826632A

5G,4G,3G,2G

Title

System and method for combined frequency-domain and time-domain pitch extraction for speech signals

Application Number:

CN20048008861

Publication Date:

30-08-2006

Current Assignee:

IBM

Family ID:

Application Date:

31-03-2004

Declaring Company:

Publication Country:

US

Priority Date:

31-03-2003

Title

System and method for combined frequency-domain and time-domain pitch extraction for speech signals

Application Number:

CN20048008861

Family ID:

Publication Country:

US

Publication Date:

30-08-2006

Application Date:

31-03-2004

Priority Date:

31-03-2003

Current Assignee:

IBM

Declaring Company:

Abstract  Abstract

The invention claims a system computer readable medium and method for voice signal is sampled the sampled voice signal is divided into overlapping frames using frequency domain analysis from the frame extracting first pitch information; providing at least one pitch candidate from the first pitch information. wherein each pitch candidate value score is combined with the spectrum possible pitch estimate of the at least one pitch candidate values each of which represents the frame using time domain analysis from the frame extracting second pitch information from the pitch information providing relevant score of the at least one pitch candidate value; and representing the pitch of the frame high value selecting one of the at least one pitch candidate values. The system computer readable medium and method for speech coding and distributed speech recognition.

A system computer readable medium and method for sampling a speech signal; dividing the sampled speech signal into overlapped frames; extracting first pitch information from a frame using frequency domain analysis; providing at least one pitch candidate each being associated with a spectral score from the first pitch information each of the at least one pitch candidate representing a possible pitch estimate for the frame; extracting second pitch information from the frame using a time domain analysis; providing a correlation score for the at least one pitch candidate from the second pitch information; and selecting one of the at least one pitch candidate to represent the pitch estimate of the frame. The system computer readable medium and method are suitable for speech coding and for distributed speech recognition.

Note:

The information in blue was extracted from the third parties (Standard Setting Organisation, Espacenet)

The information in grey was provided by the patent holder

The information in purple was extracted from the FrandAvenue

Explicitly disclosed patent:openly and comprehensibly describes all details of the invention in the patent document.

Implicitly disclosed patent:does not explicitly state certain aspects of the invention, but still allows for these to be inferred from the information provided.

Basis patent:The core patent in a family, outlining the fundamental invention from which related patents or applications originate.

Family member:related patents or applications that share a common priority or original filing.