OASIS Mailing List ArchivesView the OASIS mailing list archive below
or browse/search using MarkMail.

 


Help: OASIS Mailing Lists Help | MarkMail Help

docbook-apps message

[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]


Subject: Re: [docbook-apps] best way to convert docx to DocBook


If you are comfortable using xproc, a *very* robust solution is this:

https://transpect.github.io/

We use this for roundtripping among docx/indesign/html/xml since years (we have converted 10^5+ pages from docx/idml/xml) and we are *very* happy.

The idea is to use a central format (hubxml, wich is docbookÂ+ css attributes) to convert to/from.

The docx 2hub module is quite easy to configure and run and gives you a flat docbook you can further transform (i.e to add sectioning, semantic tagging, cleaning, etc.) via the evolvehub module.

For a complete list of the modules see here
https://github.com/transpect.

Regards,
__peppo



On Wed, Apr 14, 2021 at 5:02 PM Michel van den Burg <burg@coutinho.nl> wrote:

Hello everyone,

Â

I would like to know your opinion about this question: What would you consider the best way to get from Microsoft .docx to DocBook these days? We have to work with styled Word documents (every element has been styled, fortunately). We know XMLMind has a converter, there are some Github projects (https://github.com/albertjuhe/word2docbook and https://github.com/oxygenxml/oxygen-resources-converter) and I would say oXygen can do this.

Â

And, can it be done in one transformation, or should we first convert to HTML, for example?

Â

I hope this question isnât inappropriate. Thanks in advance!

Â

Kind regards,

Â

Michel van den Burg

digital publishing | Uitgeverij Coutinho

06 - 10 75 75 14

LinkedIn

Â



[Date Prev] | [Thread Prev] | [Thread Next] | [Date Next] -- [Date Index] | [Thread Index] | [List Home]