You can tailor your queries to deal with such situations. In most cases, adding a little weight to the title will result in decent search results. That falls apart with "Manic Miner" but you could fix that with an additional boost for an exact match in the title field. That would ensure the original appears above "Manic Miner Blah Blah Edition".kolbeck wrote: ↑Mon Sep 07, 2020 1:28 pmBut using Elasticsearch does not solve the initial problem. For example, on ZXInfo searching for Jetpac - gives by coincidence the "real" Jetpac as first result - https://zxinfo.dk/search/jetpac
Searching for Manic Miner, the "real" Manic Miner ends up in 18th place. - https://zxinfo.dk/search/manic%20miner -
there's no way of knowing what's the real deal, unless someone manually put it into the engine. And last, sorting just makes this worse
(FYI - ZXInfo sorts search results, based in scoring by Elasticsearch - which more or less is an indication of how good the fit was)
/T
Another option would be to apply a small negative score to that phrase (or similar) appearing in the 'Mod' field, so that all mods & clones are pushed down the search results. That would help in situations like, for example, the user searches for "Jet Set Willy" and gets Jet Set Willy first as expected, but Jet Set Willy II appears beneath a load of clones & mods because the 'match_phrase' boost hasn't kicked in. Or because someone wrote "Jetset Willy".
Elasticsearch isn't a black box, it just requires a little massaging and knowledge of the data to return the results you want.