Hey everyone, firstly well done on creating an amazing collection of important literature.
I'm a post-grad linguistics researcher at the University of Wales, Swansea, specialising in children's literature. I'm currently building a corpus of 'classic' children's stories, and using Project Gutenberg to download texts (eg Treasure Island, Peter Pan, Little Women, Adventures of Mark Twain). As the period of publication is around 1860 - 1920, there are no copyright issues in downloading the texts and using them for linguistic analysis.
I'm particularly interested in collections such as the 'Half Dime Library' you have on your site, published during the same period.
Is there a way of extracting the text, or downloading rich text files (RTF), without the illustrations?