Ticket #20327 (new Bug)

Opened 4 months ago

Normalize en and em dash

Reported by: ivanteoh Owned by:
Priority: minor Milestone: 4.x
Component: Unknown Version: 4.1
Keywords: Cc:

Description

When user copy and paste en and em dash from Microsoft word to title field, it will convert to 2013 and 2014 as short name.

>>> site = app['Plone']
>>> from plone.i18n.normalizer.interfaces import IIDNormalizer
>>> from plone.i18n.normalizer.interfaces import IURLNormalizer
>>> from zope.component import getUtility
>>> inor = getUtility(IURLNormalizer)
>>> unor = getUtility(IIDNormalizer)
>>> inor.normalize(u'title — 17.12.2015')
'title-2014-17.12.2015'
>>> unor.normalize(u'title — 17.12.2015')
'title-2014-17-12-2015'
>>> unor.normalize(u'title – 20.12.2015')
'title-2013-20-12-2015'
>>> inor.normalize(u'title – 20.12.2015')
'title-2013-20.12.2015'

Tested on Plone 4.1.6 and Plone 5.0. Assuming it happens on the version in between as well.

Is this a bug? Any way, we can get around with it? It is confusing for user. Currently, the user have to manually change those characters to '-'.

Note: See TracTickets for help on using tickets.