I notice that when you create a PDF or Word asset it converts the file to HTML. Where is this converted data kept?
I am looking to create an asset listing that lists PDF files giving the viewer the option of downloading the PDF or viewing it in a new window as HTML (Something similar to how gmail handles attachments)
I can't seem to find where the converted data is kept to be able to recall it, it is not listed as an attribute of the asset.
Any assistance would be greatly appreciated.
[quote]I can’t seem to find where the converted data is kept to be able to recall it, it is not listed as an attribute of the asset.
[right][post=“10375”]<{POST_SNAPBACK}>[/post][/right][/quote]
We do not store the contents in HTML – we convert to HTML briefly in order to index the contents in our search index. We would have to develop something to retain the HTML conversion of PDF and Word documents.
This is because we do not retain any of the source formatting – we’re only interested in the words.