Hi,
Can someone please tell me:
a) How to get HTMLTidy to clean metadata field input (or any other non-wysiwyg field); or
b) if post-3.6 versions of Matrix already utilise HTMLTidy for metadata field input.
I have an XHTML 1.0 non-compliance issue at the moment due to Word extended ASCII characters rendering in metadata and content text extracted from asset metadata (for asset listings, search etc).
I've searched the forums and bug tracker; I haven't yet resorted to downloading 3.10 and reading the change log …
Cheers,
Currently, HTML Tidy is only used on WYSIWYG content. It is not invoked for any other data within Matrix. You would need to fund additional development for this to happen.
However, I'm not sure that HTML Tidy is the answer here – you shouldn't have HTML in your metadata fields. It may be that we just need to ensure that for metadata fields, any non-standard characters are converted to the correct HTML entity.
I would recommend logging a bug that states that non-HTML characters are not converted in metadata fields properly.
Done: http://bugs.matrix.squiz.net/view_bug.php?bug_id=1553
Cheers,