python-boilerpipe A python wrapper for Boilerpipe, an excellent Java library for boilerplate removal and fulltext extraction from HTML pages. source: $link[host] Tags: boilerpipe, neotext, programming, python, text-processing Read Original Source