How to convert a webpage (from an intranet file wiki) to an Office document?

I have a set of Wiki pages (MediaWiki style) on my corporate intranet that I would like to convert to Microsoft Office Word documents (or something that I can import into it). I am looking for something that has:

Requirements

  • Keep formatting as much as possible.
  • It does not require changing anything on the server hosting the Wiki (the plugin cannot be added or the configuration files can be changed on my part)
  • The solution can be software (since I am also a developer), in the flavor of Python / C # / C ++, etc.

Exceptions

  • Not like a solution like “Wiki for Acrobat PDF Pro for Microsof Office Word” (since we don’t have Acrobat PDF Pro). In fact, even the non-Pro version (which allows "Save as Microsoft Word online") is not available in my company (a very old version of Adobe Suite). Nevertheless, I can still export the page in pdf format, but it looks good from the Wiki (because some element is too large for A4 format, and additional parts are deleted from the created pdf. We would like them to be included anyway and could play with the "bad" formatting in Word in the end.
  • Since this is an intranet wiki, online solutions go beyond
  • Solutions that imply that I could copy the db Wiki and perform the operation elsewhere (e.g. at home) are also not in scope

Functions

  • Windows, Linux (CentOS)
  • , ,

- , ?

+5
2

- URL- Wiki Word Open Document, ., URL- http://en.wikipedia.org/w/index.php?title=Microsoft_Word&printable=yes " ". - , .

, script VBA, :

Sub OpenFromWiki()

    Documents.Open FileName:= _
        "http://en.wikipedia.org/w/index.php?title=Microsoft_Word&printable=yes", _
         ConfirmConversions:=False, ReadOnly:=True, AddToRecentFiles:=False, _
        PasswordDocument:="", PasswordTemplate:="", Revert:=False, _
        WritePasswordDocument:=""

End Sub
+6

OpenDocument Export Extension, "" OpenDocument, MS Word.

mwlib python, , .

+1

All Articles