To test several pitch extraction algorithms a common used database is, among others, the Keele Database.
Part of my past research activities was devoted to re-estimate the keele pitch labels basing on the original laryngograph (lx) data. I refer to the new pitch labels as the "corrected" pitch values.
The reason for using the term "corrected" in inverted commas, stems from a very interesting discussion I had with Dr. George Meyer, from Keele University.
When I asked Dr. Meyers his opinion on the advantages of having pitch reference estimate with no doubling/halving errors, he briefly, pointed out that:The following shows the differences between the Keele orginal pitch reference values and the "corrected" one. If you can't display the image or it has too poor resolution, click here!
| keele | -> | the original keele pitch (red line) |
| wautoclxHP | -> | pitch computed by means of wautoc applied to the filtered lx signal |
| wautoclx | -> | pitch computed by means of wautoc applied to the lx signal |
| praatlxHP | -> | pitch computed by praat on the high-pass filtered lx signal and checked by me (black stars) |
| praatSIG | -> | pitch computed by praat on the audio signal (.pes) |
In the specific example above it is evident how the original reference pitch estimate (red line) jumps from 0 to the "correct" pitch value (black stars).
In case you wish to examine other differences between the original Keele pitch estimates and those I obtained, relative to the m1nw0000.pes file, here (10 Mb!) there's the matlab .fig file which allows to zoom in every voiced section.
If you're interested in the "corrected" pitch estimates relative to the whole Keele database, saved the Praat format, download the following: KeeleCorrectedPitch.tar.gz.
