Ticket #906 (closed enhancement: fixed)

Opened 3 years ago

Last modified 2 years ago

Ability to search without accents for accented words

Reported by: Stiivi Owned by: thejimmyg
Priority: awaiting triage Milestone: ckan-sprint-2012-03-05
Component: ckan Keywords:
Cc: amercader Repository: ckan
Theme: none


In Slovakia users are expecting from sites to be able to search without typing accents, for example if they do not have SK keyboard, just US. For example searching for: 'Obyvateľstvo' and 'obyvatelstvo' (population) or 'štatistika' and 'statistika' (statistics) should yield same results.

This should work the other way around as well, as some people might enter entries without accents into CKAN instance and others might search with accents.

For SK language simple ASCII transliteration for searched term and indexed words is sufficient (iconv ASCIITRANSLIT).

Change History

comment:1 Changed 2 years ago by thejimmyg

  • Owner set to thejimmyg
  • Repository set to ckan
  • Theme set to none
  • Status changed from new to assigned
  • Milestone set to current-ckan-sprint-2011-12-19

comment:2 Changed 2 years ago by rgrp

Do we need to change in core code or just configure solr?

comment:3 Changed 2 years ago by dread

  • Cc amercader added

This was done in #1701 but only in a custom SOLR schema in the ecportal extension: https://github.com/okfn/ckanext-ecportal/commit/6682926d8895f146cdf1e52ab4fbead9b065af77

Can the ASCIIFoldingFilterFactory be added to core CKAN's SOLR schema for all CKAN users to benefit from?

comment:4 Changed 2 years ago by amercader

Yes, I don't think there will be any problem, and we won't need to create a new version of the schema as the change is backwards compatible.

comment:5 Changed 2 years ago by kindly

  • Status changed from assigned to closed
  • Resolution set to fixed
Note: See TracTickets for help on using tickets.