Uploaded image for project: 'Confluence Data Center'
  1. Confluence Data Center
  2. CONFSERVER-39441

File Preview Macro Showing Garbled Characters for Office Documents Containing Certain Special Characters

      colored text

      NOTE: This bug report is for Confluence Server. Using Confluence Cloud? See the corresponding bug report.

      Summary

      Certain special characters inside MS office attachment such as Word (both .doc or .docx) and Excel (.xls) are not displaying properly when viewed in the new File Preview macro (newly introduced in 5.7.x)

      Steps to Reproduce

      Steps to reproduce:

      1. Create a MS Office file containing special characters:
      2. Create a page
      3. Edit the page, Insert > File & Image > insert the file > save page
      4. Preview the Excel file. The special characters will be garbled:

      or

      1. Add a page, insert the macro Attachments
      2. Add the sample attachments to the page
      3. Save the page, click on "Preview"
      4. The special characters are garbled

      Expected Results

      All characters are displayed correctly

      Actual Results

      Some characters are garbled and not displayed properly

      Notes

      This issue is not reproducible in

      1. PDF file
      2. While clicking "View" in the attachment macro (Using the Office Viewer instead of View File Macro)

      Characters observed to be causing this issue:

      1. Umlaut characters like "ß" .
      2. Polish characters like "ą"
      3. All Cyrillic characters
      4. All Arabic characters

      Workaround

      Try the resolution provided in the following documentation: The Text in a PowerPoint, Excel or Word Document is Missing or Looks Different when Using the Viewfile Macro

        1. Bug.png
          57 kB
          Janet Albion
        2. encodingtest.docx
          38 kB
          Peter Andersen
        3. encodingtest.xlsx
          39 kB
          Peter Andersen
        4. encodingtest-excel-outcome.png
          83 kB
          Peter Andersen
        5. encodingtest-excel-outcome.png
          83 kB
          Peter Andersen
        6. german-umlauts.xls
          6 kB
          Janet Albion
        7. polish-characters.docx
          13 kB
          Monique Khairuliana
        8. Sample.png
          45 kB
          Janet Albion
        9. screenshot-1.png
          15 kB
          Monique Khairuliana
        10. screenshot-2.png
          150 kB
          Shirley Jhirad
        11. testcyrillic.doc
          23 kB
          Monique Khairuliana
        12. образец нс протокол опроса пострадавшего, очевидца.doc
          40 kB
          Anton Shaleev
        13. العَرَبِية‎.docx
          11 kB
          Monique Khairuliana

            [CONFSERVER-39441] File Preview Macro Showing Garbled Characters for Office Documents Containing Certain Special Characters

            Zac Xu added a comment -

            We are closing this ticket as we believe a fix is available for most customers experiencing this issue in Confluence 7.10+. This issue can actually be broken down into few separate issues.

            1. Garbled single byte Western characters on Linux

            We believe this was resolved with Linux installer changes in Confluence 7.10 that install missing fonts when Confluence is installed. If you are still experiencing this issue after upgrading to Confluence 7.10, please raise a support ticket and include details of your OS name, OS version and include a sample file and we’ll look into it.

            2. Garbled multi-byte CJK characters on Linux

            This should be partially fixed since 7.10 if Confluence is installed through the Linux installer. We upgraded the libraries used for Office document processing in 7.10 and improved the Linux installer script to install some fallback fonts which improves Confluence compatibility with Office documents with certain CJK fonts on Linux, especially CentOS. There are some edge cases the libraries cannot support for now. If you experience that on Confluence 7.10+, please contact support and include a sample file and specify your OS name and version when raising new issues, so that we could investigate further. It is also worth noting that as far as we have seen, this issue does not occur when Confluence is running on a Windows server. If you are interested in the remaining edge cases for garbled CJK characters please watch https://jira.atlassian.com/browse/CONFSERVER-61040 instead.

            Zac Xu added a comment - We are closing this ticket as we believe a fix is available for most customers experiencing this issue in Confluence 7.10+. This issue can actually be broken down into few separate issues. 1. Garbled single byte Western characters on Linux We believe this was resolved with Linux installer changes in Confluence 7.10 that install missing fonts when Confluence is installed. If you are still experiencing this issue after upgrading to Confluence 7.10, please raise a support ticket and include details of your OS name, OS version and include a sample file and we’ll look into it. 2. Garbled multi-byte CJK characters on Linux This should be partially fixed since 7.10 if Confluence is installed through the Linux installer. We upgraded the libraries used for Office document processing in 7.10 and improved the Linux installer script to install some fallback fonts which improves Confluence compatibility with Office documents with certain CJK fonts on Linux, especially CentOS. There are some edge cases the libraries cannot support for now. If you experience that on Confluence 7.10+, please contact support and include a sample file and specify your OS name and version when raising new issues, so that we could investigate further. It is also worth noting that as far as we have seen, this issue does not occur when Confluence is running on a Windows server. If you are interested in the remaining edge cases for garbled CJK characters please watch https://jira.atlassian.com/browse/CONFSERVER-61040 instead.

            any update ?

            Deleted Account (Inactive) added a comment - any update ?

            We got this issue in 7.3.5 as well,

            Any solution is future release ?

            Deleted Account (Inactive) added a comment - We got this issue in 7.3.5 as well, Any solution is future release ?

            Same problem on 7.4.0 - 6.4 was working fine

            Bernd Schaper added a comment - Same problem on 7.4.0 - 6.4 was working fine

            Mandeep Kaur added a comment - - edited

            We are experiencing this issue after upgrading to 7.3.1. This was working fine in 6.13.0.
            My ppt doesn't have any special characters. It shows garbled text for regular text when previewing ppt

            Mandeep Kaur added a comment - - edited We are experiencing this issue after upgrading to 7.3.1. This was working fine in 6.13.0. My ppt doesn't have any special characters. It shows garbled text for regular text when previewing ppt

            Is there an update to this ticket? We are running 7.2.2 and we are impacted by this bug. It would be nice to have it fixed. Thanks

            Lalyn Shivers added a comment - Is there an update to this ticket? We are running 7.2.2 and we are impacted by this bug. It would be nice to have it fixed. Thanks

            Hi,

            We have the same problem with danish characters ø, æ and å.
            We just get blanks.

            Running Confluence 6.15.4

            Rene Schade added a comment - Hi, We have the same problem with danish characters ø, æ and å. We just get blanks. Running Confluence 6.15.4

            Hello everyone,

            We are also having this issue specifically when previewing pptx. Has anyone tried the workaround provided in this ticket and did it worked? Thanks. 

            We are in Confluence 7.1.1. 

            Lalyn Shivers added a comment - Hello everyone, We are also having this issue specifically when previewing pptx. Has anyone tried the workaround provided in this ticket and did it worked? Thanks.  We are in Confluence 7.1.1. 

            Chihara added a comment -

            Same bug in 6.15.x

            Chihara added a comment - Same bug in 6.15.x

            Hi Everyone,

            Have we tried the resolutions provided from the following documentation:

            Kind Regards,
            Monique

            Monique Khairuliana (Inactive) added a comment - Hi Everyone, Have we tried the resolutions provided from the following documentation: The Text in a PowerPoint, Excel or Word Document Looks Different when Using the Viewfile Macro Kind Regards, Monique

              zxu2@atlassian.com Zac Xu
              jalbion Janet Albion (Inactive)
              Affected customers:
              54 This affects my team
              Watchers:
              48 Start watching this issue

                Created:
                Updated:
                Resolved: