I'm trying to parse a URL with JSoup, which contains the following text: Ætterni. After parsing the document the same line looks like this: Ætterni.
How to prevent this form? I want the 1: 1 document to be exactly the same as it was.
the code:
doc = Jsoup.connect(url).get();
String docEncoding=doc.outputSettings().charset().name();
OutputStreamWriter writer = new OutputStreamWriter(new FileOutputStream(localLink),docEncoding);
writer.write(doc.html());
writer.close();
source
share