Uploaded image for project: 'Bitbucket Data Center'
  1. Bitbucket Data Center
  2. BSERV-7323

German Umlaut characters display incorrectly in diff view

    XMLWordPrintable

Details

    Description

      Problem description:

      In some cases, based on the file content (aka file encoding), Bitbucket server displays the Umlaut and other non-ASCII characters as either a diamond <?> or another character found in the Russian alphabet. The <?> is usually seen when a non UTF-8 file encoding such as Latin 1 is used without diff transcoding. The incorrect character is normally seen when diff transcoding is on.

      How to reproduce:

      Push the attached repo to bitbucket server and view the diff between master and the german-changes branch.

      Things to be mindful of

      1. Git offers a RAW mode for some commands which can simplify line ending problems
      2. Encoding detection should be done once per file
      3. I had to pass iconv the '-c' option to force it to skip conversion of the euro sign in the file.

      Attachments

        1. bserv-7323-repo.tar.gz
          36 kB
        2. correct.png
          correct.png
          432 kB
        3. correct-latin1.png
          correct-latin1.png
          19 kB
        4. incorrect.png
          incorrect.png
          308 kB
        5. incorrect-latin1.png
          incorrect-latin1.png
          24 kB

        Issue Links

          Activity

            People

              Unassigned Unassigned
              chparker ♚♛♜♝♞♟♔♕♖♗♘♙ (Inactive)
              Votes:
              5 Vote for this issue
              Watchers:
              24 Start watching this issue

              Dates

                Created:
                Updated:

                Backbone Issue Sync