Hi, I am working on a content migration project where in we are migrating documents (.doc) to sharepoint. Here is the approach we are following.
- Doc is converted into HTML.
- We use some tool to convert HTML to ASPX.
- ASPX is pushed into sharepoint.
I am facing some problem WRT images in document. I face no issues when an image is copied in a document. As I can retrieve this image from the content folder ("FileName_Files"). However I cannot get the image if this is embedded (Insert--> Object--> selected an image and checked the "Display as icon" check box). When I convert the doc file to HTML I get one.emz file and one gif file which is an icon image(in the folder filename_files). I want to retrieve the image which is embedded.
I have tried unzipping this emz file with gzip but it again gave me the icon not the real image.
Can any one help me in this? I just need to be able to retrieve the image.