Wish list #1
Replies: 5 comments 11 replies
-
With regard to the extended LP text; I'm wondering to which extent LP is still relevant? I know it has been extremely important, but do you see it still used in the future? Personally, I'm undecided. LP-type analysis leads to computational problems, like non-linear modelling and danger of instability etc. whereas operations in the STFT domain are WYSIWYG. For that reason, I've pretty much stopped using LP, but that's just me. The question is more whether there is, for example, solid science with methods combining ML and LP? One thing which I'm planning to add is a vocoder example using LP, where I would apply a formant structure to something like a trumpet sound. That would be a nice demo which simultaneously demonstrates the effect of formants. |
Beta Was this translation helpful? Give feedback.
-
I'm also a bit ambivalent with respect to this. However, on my current speech course, I do teach LPC. More specifically, and after teaching basics of speech production on an earlier lecture, I start from the acoustic theory of speech production (source-filter model) by giving an overview of the concept and reviewing the three components (source, tract, lip radiation). Then I proceed to modeling of vocal tract acoustics with lossless tubes in more detail, and proceed from continuous physical domain to digital lossless tube model(s) of the tract. The motivation is to explain the connection of the physics/physiology and why the tract can be seen as a filter. After those, I present LPC as a practical means to estimate AR model parameters and demonstrate the equivalence of LPC with the lossless tube model. I also went through both autocorrelation and covariance estimators for LP. In our exercises,
So, LPC is also used in the exercises (implemented first, later used for speech synthesis). At least our machine learning oriented students liked this type of "low-level" treatment of the topic a lot. Some commented that they anyway encounter so much "data driven blackbox" stuff on machine learning studies that having "old-school" content is useful. Also, many of them had already had several other courses with statistical signal processing etc. where they have used different stuff from Wiener filters to HMMs, LPC etc., but with limited connection to any physical/domain-specific phenomena. So I got the impression they appreciated the contents, although I did not specifically ask about the LP part. Feedback from the course and exercises (which I asked separately about for each) was extremely good though (clearly above 6 on scale 1–7). Whether any of the LPC stuff is relevant for modern speech tech use is another question... However, at least the aim on my course is to get the students deeply familiar with speech as a phenomenon, and with ways to tackle it with engineering. Not so much about teaching them state-of-the-art machine learning methods in speech. |
Beta Was this translation helpful? Give feedback.
-
Another issue is the table of contents, which has organically grown into what it is now, but I've never put much thought into it. So any suggestions on a better organization of the content? Therefore, one idea could be for example
Edit on 17.6.2022: Added evaluation |
Beta Was this translation helpful? Give feedback.
-
Small thing, but I'd love an easy way to download a PDF of the whole book so I can read it on my Kindle. I only found a way to export to PDF chapter by chapter. |
Beta Was this translation helpful? Give feedback.
-
Reviewing content this year, I've found quite a bit of things which require improvements. Here's a list of things that I'm currently aware of:
I'll continue the list as I progress in reviewing it. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
A collection of ideas from different contributors:
Old / Mostly solved:
Beta Was this translation helpful? Give feedback.
All reactions