: https://github.com/subes/invesdwin-webproxy
It supports HttpClient and HtmlUnit (a browser without a browser that supports javascript) and parallelizes it, if required, for a large proxy pool. I can also recommend JSoup for static html processing.
subes source
share