Is there any way to search through a document in Matrix? Either Word or PDF…
Not Word documents, but if you have the pdftohtml external tool configured you can have the contents of PDF files indexed. Using this method, you could then find PDF files based on the text of a PDF file.
I did not know that. That's awesome.
MS Word, Excel and Powerpoint documents are all indexed if you add them as the correct asset types. You can even index password restricted Excel documents if you include the password as an attribute of the asset.
Awesome, I didn't know these asset types were indexed as well!
The Antiword third-party application must be installed and associated in "External Tools Configuration" to enable indexing of the text content of Word documents.
Edit: Bad-like grammar ![]()
For indexing the content of MS Word document assets, Matrix uses the Antiword external tool, which needs to be separately downloaded. Should add that because that tool has not been updated in quite a while, it will probably only work with versions of Word up to 2003 (ie. not with Word 2007’s OOXML documents).
MS Word documents are indexed if you have the Antiword tool enabled, but I just looked at the source code for both Excel and Powerpoint document assets, and I can't see any code that allows them to be indexed...
Cool. I did not know this!
Great, this is more ideal for me than indexing Word Documents. Where would i find this tool? I don't see it in the System Tools section...
You need to install it on the web server and then tell Matrix where it is by setting the path in System Config > External Tools Config.
So can anyone confirm if Excel and Powerpoint assets are/aren't indexed? I always thought they were.
They definitely aren't (the contents of the document, that is). :(
Catdoc seems to do Excel, Word and Powerpoint files, I’m not sure if there is anything else out there that is better.
http://www.wagner.pp.ru/~vitus/software/catdoc/