Note that there are some explanatory texts on larger screens.

plurals
  1. POHow to define/declare utf-8 code points for Turkish special chars (non-ascii) to use them as standart utf-8 encoding?
    primarykey
    data
    text
    <p>Türkish chars 'ÇçĞğİıÖöŞşÜü' are not handled correctly in utf-8 encoding altough they all seem to be defined. Charcodes of all of them is 65533 (replacemnt character, possibly for error display) in usage and a question mark or box is displayed depending on the selected font. In some cases 0/null is returned as charcode. On the internet, there are lots of tools which give utf-8 definitions of them but I am not sure if tools use any defined (real/international) registry or dynamicly create the definition with known rules and calculations. Fonts for them are well-defined and no problem to display them when we enter code points manually. This proves that they are defined in utf-8. But on the other hand they are not handled in encodings or tranaformations such as ajax requests/responses.</p> <p>So the base question is "HOW CAN WE DEFINE A CODEPOINT FOR A CHAR"? The question may be tailored as follows to prevent mis-conception. Suppose we have prepared the encoding data for "Ç" like this -> Character : Ç Character name : LATIN CAPITAL LETTER C WITH CEDILLA Hex code point : 00C7 Decimal code point : 199 Hex UTF-8 bytes : C387 ...... Where/How can we save this info to be a standard utf-8 char? How can we distribute/expose it (make ready to be used by others) ? Do we need any confirmation by anybody/foundation (like unicode/utf-8 consortium) How can we detect/fixup errors if they are already registered but not working correctly? Can we have custom-utf8 configuration? If yes how?</p> <p>Note : No code snippet is needed here as it is not mis-usage problem. </p>
    singulars
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    plurals
    1. This table or related slice is empty.
    1. COEvery Unicode character, including the Turkish alphabet, can be expressed in UTF-8 encoding. If you are seeing replacement characters, your text is either encoded incorrectly or you are using the wrong encoding to read it. The conversion between Unicode codepoints and UTF-8 byte strings is well-defined and fixed (see http://en.wikipedia.org/wiki/UTF-8). You cannot customize it.
      singulars
    2. COI had read http://en.wikipedia.org/wiki/UTF-8 and many more before asking my question. When I dont give any charset, then page loads nice but ajax requests (except Firefox and Opera) fails. When I give iso-8859-9/windows-1254 page loads good, ajax works good with only Firefox. When I give utf-8, then page cant show special chars but ajax works with all (6 major) browsers. Those problems do not occure with other languages. This shows that there are some irrelevant definitions. By myself there is no problem by doing conversions but I would like Turkish chars to work as well as other languages.
      singulars
    3. CO@İlhanÇELİK The page of this question (stackoverflow) is encoded in UTF-8. Do you see characters used in Turkish correctly or are they broken here too?
      singulars
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload