I recently started a website http://www.tastestalkr.com. I crawl the web looking for mp3 blogs then I index the mp3 blogs frequently. I currently do a bunch of keyword analysis using the wordnet (http://wordnet.princeton.edu) library. I would like to do some more advanced analysis like theme recognition or unique feature extraction. I have read the book "programming for collective intelligence", its a great start, but I wondered if their are any libraries out their that do this already?
* Python preferable, but I'll take what I can get.