StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POTransliterate any convertible utf8 char into ascii equivalent
primarykey
Id
13614622
data
AcceptedAnswerId
19982111
AnswerCount
5
ClosedDate
CommentCount
9
CommunityOwnedDate
CreationDate
2012-11-28T21:19:18.297
FavoriteCount
7
LastActivityDate
2016-05-30T12:43:54.093
LastEditDate
2017-05-23T11:51:41.777
LastEditorUserId
-1
OwnerUserId
555097
ParentId
0
PostTypeId
1
Score
18
ViewCount
19721
LastEditorDisplayName
text
Body
Is there any good solution out there that does this transliteration in a good manner? I've tried using <code>iconv()</code>, but is very annoying and it does not behave as one might expect. <ul> <li>Using <code>//TRANSLIT</code> will try to replace what it can, leaving everything nonconvertible as "?" </li> <li>Using <code>//IGNORE</code> will not leave "?" in text, but will also not transliterate and will also raise <code>E_NOTICE</code> when nonconvertible char is found, so you have to use iconv with @ error suppressor</li> <li>Using <code>//IGNORE//TRANSLIT</code> (as some people suggested in PHP forum) is actually same as <code>//IGNORE</code> (tried it myself on php versions 5.3.2 and 5.3.13)</li> <li>Also using <code>//TRANSLIT//IGNORE</code> is same as <code>//TRANSLIT</code></li> </ul> It also uses current locale settings to transliterate. WARNING - a lot of text and code is following! Here are some examples: <pre><code>$text = 'Regular ascii text + čćžšđ + äöüß + éĕěėëȩ + æø€ + $ + ¶ + @'; echo ' original: ' . $text; echo ' regular: ' . iconv("UTF-8", "ASCII//TRANSLIT", $text); //> regular: Regular ascii text + ????? + ???ss + ?????? + ae?EUR + $ + ? + @ setlocale(LC_ALL, 'en_GB'); echo ' en_GB: ' . iconv("UTF-8", "ASCII//TRANSLIT", $text); //> en_GB: Regular ascii text + cczs? + aouss + eeeeee + ae?EUR + $ + ? + @ setlocale(LC_ALL, 'en_GB.UTF8'); // will this work? echo ' en_GB.UTF8: ' . iconv("UTF-8", "ASCII//TRANSLIT", $text); //> en_GB.UTF8: Regular ascii text + cczs? + aouss + eeeeee + ae?EUR + $ + ? + @ </code></pre> Ok, that did convert č ć š ä ö ü ß é ĕ ě ė ë ȩ and æ, but why not đ and ø? <pre><code>// now specific locales setlocale(LC_ALL, 'hr_Hr'); // this should fix croatian đ, right? echo ' hr_Hr: ' . iconv("UTF-8", "ASCII//TRANSLIT", $text); // wrong > hr_Hr: Regular ascii text + cczs? + aouss + eeeeee + ae?EUR + $ + ? + @ setlocale(LC_ALL, 'sv_SE'); // so this will fix swedish ø? echo ' sv_SE: ' . iconv("UTF-8", "ASCII//TRANSLIT", $text); // will not > sv_SE: Regular ascii text + cczs? + aouss + eeeeee + ae?EUR + $ + ? + @ //this is interesting setlocale(LC_ALL, 'de_DE'); echo ' de_DE: ' . iconv("UTF-8", "ASCII//TRANSLIT", $text); //> de_DE: Regular ascii text + cczs? + aeoeuess + eeeeee + ae?EUR + $ + ? + @ // actually this is what any german would expect since ä ö ü really is same as ae oe ue </code></pre> Lets try with <code>//IGNORE</code>: <pre><code>echo ' ignore: ' . iconv("UTF-8", "ASCII//IGNORE", $text); //> ignore: Regular ascii text + + + + + $ + + @ //+ E_NOTICE: "Notice: iconv(): Detected an illegal character in input string in /var/www/test.server.web/index.php on line 49" // with translit? echo ' ignore/translit: ' . iconv("UTF-8", "ASCII//IGNORE//TRANSLIT", $text); //same as ignore only> ignore/translit: Regular ascii text + + + + + $ + + @ //+ E_NOTICE: "Notice: iconv(): Detected an illegal character in input string in /var/www/test.server.web/index.php on line 54" // translit/ignore? echo ' translit/ignore: ' . iconv("UTF-8", "ASCII//TRANSLIT//IGNORE", $text); //same as translit only> translit/ignore: Regular ascii text + cczs? + aouss + eeeeee + ae?EUR + $ + ? + @ </code></pre> Using <a href="https://stackoverflow.com/a/6857767/555097">solution of this guy</a> also does not work as wanted: <code>Regular ascii text + YYYYY + aous + eYYYeY + aoY + $ + � + @</code> Even using PECL intl <a href="http://www.php.net/manual/en/class.normalizer.php" rel="nofollow noreferrer">Normalizer</a> class (which is not awailable always even if you have PHP > 5.3.0, since ICU package intl uses may not be available to PHP i.e. on certain hosting servers) produces wrong result: <pre><code>echo ' normalize: ' .preg_replace('/\p{Mn}/u', '', Normalizer::normalize($text, Normalizer::FORM_KD)); //>normalize: Regular ascii text + cczsđ + aouß + eeeeee + æø€ + $ + ¶ + @ </code></pre> So is there any other way of doing this right or the only proper thing to do is to do <code>preg_replace()</code> or <code>str_replace()</code> and define transliteration tables yourself? // appendix: I have found on ZF wiki debate from 2008 about <a href="http://framework.zend.com/wiki/display/ZFPROP/Zend_Filter_Transliteration+-+Martin+Hujer" rel="nofollow noreferrer">proposal for Zend_Filter_Transliterate</a> but project was dropped since in some languages it is not possible to convert (i.e. chinese), but still for any latin- and cyrilic-based language IMO this option should exist.
Tags
<php><utf-8><ascii><iconv><transliteration>
Title
Transliterate any convertible utf8 char into ascii equivalent
singulars
PostAcceptedAnswerId
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. USCommunity
UserOwnerUserId
1. USIvan Hušnjak
plurals
PostLinksPostIdRelatedPostId
1. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
2. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 POTransliterate any convertible utf8 char into ascii equivalent
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 POTransliterate any convertible utf8 char into ascii equivalent
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
3. VO
 singulars
 PostPostId
 POTransliterate any convertible utf8 char into ascii equivalent
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.