• @jmcs@discuss.tchncs.de
    link
    fedilink
    198 months ago

    Besides most text on the average computer is either within some configuration file (which tend to use latin script), or within some SGML derived format which has a bunch of latin characters in it. For network transmission most things will use HTML, XML or JSON and use English language property names even in countries that don’t speak English (see Yandex’s and Baidu’s APIs for example).

    No one is moving large amounts of .txt files around.

    • @Buckshot@programming.dev
      link
      fedilink
      278 months ago

      You’ve never worked in finance then. All our systems at work do nothing but move large amounts of txt files around.

      That said, many of our clients still don’t support utf-8 so its all ascii and non-latin alphabets are screwed. They can’t even handle characters 128-255 so even stuff like £ is unsupported.

      • LaggyKar
        link
        fedilink
        138 months ago

        That said, many of our clients still don’t support utf-8 so its all ascii and non-latin alphabets are screwed.

        Ah, yes, I heard about that sort of thing. Some bank getting a GDPR complaint because they couldn’t correct the spelling of someone’s name, because their system uses EBCDIC.

        • @fibojoly@sh.itjust.works
          link
          fedilink
          12
          edit-2
          8 months ago

          Its not a joke. I worked for a big european bank network and the software there didn’t know how to translate from EBCDIC to UTF8 because none of the devs writing the software knew enough of the other side (mainframe vs PC) to realise this was an issue.

          Their solution was “if the file has a ? in it when we receive it, it’s probably a £”. Which of course completely breaks down the day you have any other untranslated character.

          I spent fucking weeks explaining this issue and why this was abominable, but apparently this wasn’t enough of an issue for people to fix it. Go figure…