Formatting WordML to other XML vocabularies using XSLT is fine, but it's not easy and needs good XSLT skills.
Generating other things from PDF is usually very hard. I don't know "tagged PDF".
Michael Kay
http://www.saxonica.com/
Author, XSLT Programmer's Reference and XPath 2.0 Programmer's Reference