2012-02-08, 17:30:42
geroyche Wrote:on second thought... might it be the html entities that throw your indexer off?
That's exactly what you need to take care off. As the Search plugin can't determine if this should be indexed as text which incidentally includes < and > or HTML, you have to make sure that the resulting text is pure text with no tags and HTML entities.
For HTML e.g. use:
Code:
html_entity_decode(strip_tags($content), ENT_QUOTES, 'UTF-8');
Code:
html_entity_decode(strip_tags(htmlspecialchars_decode(stripslashes($content))), ENT_QUOTES, 'UTF-8');