Uploaded image for project: 'Atlassian Intelligence'
  1. Atlassian Intelligence
  2. AI-467

Mixed formatting in single cell in Excel document causes Confluence indexing errors

    • Severity 3 - Minor

      NOTE: This bug report is for Confluence Cloud. Using Confluence Server? See the corresponding bug report.

      Steps to reproduce:

      1. Create an Excel spreadsheet and enter some text in a cell
      2. Select part of that cell and change its formatting (e.g. bold, italics, underline)
      3. Save this spreadsheet and upload to Confluence
      4. Rebuild search index

      Error in logs:

      2011-10-02 14:23:25,893 ERROR [Indexer: 1] [officeconnector.index.excel.ExcelXMLTextExtractor] endDocument expected [ 1 ] entries but read [ 2 ]
       -- referer: http://localhost:5350/admin/search-indexes.action | url: /admin/reindex.action | userName: admin | action: reindex
      

      An example .xlsx is attached.

      Here are the contents of sharedStrings.xml when the .xlsx is unarchived, which appears to contain the relevant data:

      <?xml version="1.0" encoding="UTF-8" standalone="yes"?>
      <sst xmlns="http://schemas.openxmlformats.org/spreadsheetml/2006/main" count="1" uniqueCount="1">
      	<si>
      		<r>
      			<t>foo</t>
      		</r>
      		<r>
      			<rPr>
      				<u/>
      				<sz val="10"/>
      				<rFont val="Verdana"/>
      			</rPr>
      			<t>bar</t>
      		</r>
      		<phoneticPr fontId="1" type="noConversion"/>
      	</si>
      </sst>
      

        1. mixed_formatting.xlsx
          28 kB
          Robert Chang

            [AI-467] Mixed formatting in single cell in Excel document causes Confluence indexing errors

            pqz made changes -
            Component/s Original: Search - Core [ 46383 ]
            Component/s New: Search - Core [ 75296 ]
            Fix Version/s Original: 5.8.13 [ 67871 ]
            Key Original: CONFCLOUD-23376 New: AI-467
            Rank (Obsolete) Original: 126370000000
            Symptom Severity New: Severity 3 - Minor [ 14432 ]
            Affects Version/s Original: 5.7.1 [ 67767 ]
            Affects Version/s Original: 5.7 [ 67750 ]
            Affects Version/s Original: 5.6.5 [ 67749 ]
            Affects Version/s Original: 5.6.4 [ 67748 ]
            Affects Version/s Original: 5.5 [ 67726 ]
            Affects Version/s Original: 5.4.3 [ 67714 ]
            Project Original: Confluence Cloud [ 18513 ] New: Atlassian Intelligence [ 23110 ]
            Monique Khairuliana (Inactive) made changes -
            Workflow Original: Confluence Workflow - Public Facing - Restricted v5 - TEMP [ 2368624 ] New: JAC Bug Workflow v3 [ 3427236 ]
            Status Original: Resolved [ 5 ] New: Closed [ 6 ]
            Katherine Yabut made changes -
            Workflow Original: Confluence Workflow - Public Facing - Restricted v5 [ 2246338 ] New: Confluence Workflow - Public Facing - Restricted v5 - TEMP [ 2368624 ]
            Katherine Yabut made changes -
            Workflow Original: Confluence Workflow - Public Facing - Restricted v5.1 - TEMP [ 2203671 ] New: Confluence Workflow - Public Facing - Restricted v5 [ 2246338 ]
            Katherine Yabut made changes -
            Workflow Original: Confluence Workflow - Public Facing - Restricted v5 - TEMP [ 2142034 ] New: Confluence Workflow - Public Facing - Restricted v5.1 - TEMP [ 2203671 ]
            Katherine Yabut made changes -
            Workflow Original: Confluence Workflow - Public Facing - Restricted v5 [ 1890926 ] New: Confluence Workflow - Public Facing - Restricted v5 - TEMP [ 2142034 ]
            Katherine Yabut made changes -
            Workflow Original: Confluence Workflow - Public Facing - Restricted v3 [ 1820171 ] New: Confluence Workflow - Public Facing - Restricted v5 [ 1890926 ]
            jonah (Inactive) made changes -
            Description Original: h3. Steps to reproduce:
            # Create an Excel spreadsheet and enter some text in a cell
            # Select part of that cell and change its formatting (e.g. bold, italics, underline)
            # Save this spreadsheet and upload to Confluence
            # Rebuild search index

            *Error in logs:*
            {code}
            2011-10-02 14:23:25,893 ERROR [Indexer: 1] [officeconnector.index.excel.ExcelXMLTextExtractor] endDocument expected [ 1 ] entries but read [ 2 ]
             -- referer: http://localhost:5350/admin/search-indexes.action | url: /admin/reindex.action | userName: admin | action: reindex
            {code}

            An example .xlsx is attached.

            Here are the contents of sharedStrings.xml when the .xlsx is unarchived, which appears to contain the relevant data:
            {code}
            <?xml version="1.0" encoding="UTF-8" standalone="yes"?>
            <sst xmlns="http://schemas.openxmlformats.org/spreadsheetml/2006/main" count="1" uniqueCount="1">
            <si>
            <r>
            <t>foo</t>
            </r>
            <r>
            <rPr>
            <u/>
            <sz val="10"/>
            <rFont val="Verdana"/>
            </rPr>
            <t>bar</t>
            </r>
            <phoneticPr fontId="1" type="noConversion"/>
            </si>
            </sst>
            {code}
            New: {panel:bgColor=#e7f4fa}
              *NOTE:* This bug report is for *Confluence Cloud*. Using *Confluence Server*? [See the corresponding bug report|http://jira.atlassian.com/browse/CONFSERVER-23376].
              {panel}

            h3. Steps to reproduce:
            # Create an Excel spreadsheet and enter some text in a cell
            # Select part of that cell and change its formatting (e.g. bold, italics, underline)
            # Save this spreadsheet and upload to Confluence
            # Rebuild search index

            *Error in logs:*
            {code}
            2011-10-02 14:23:25,893 ERROR [Indexer: 1] [officeconnector.index.excel.ExcelXMLTextExtractor] endDocument expected [ 1 ] entries but read [ 2 ]
             -- referer: http://localhost:5350/admin/search-indexes.action | url: /admin/reindex.action | userName: admin | action: reindex
            {code}

            An example .xlsx is attached.

            Here are the contents of sharedStrings.xml when the .xlsx is unarchived, which appears to contain the relevant data:
            {code}
            <?xml version="1.0" encoding="UTF-8" standalone="yes"?>
            <sst xmlns="http://schemas.openxmlformats.org/spreadsheetml/2006/main" count="1" uniqueCount="1">
            <si>
            <r>
            <t>foo</t>
            </r>
            <r>
            <rPr>
            <u/>
            <sz val="10"/>
            <rFont val="Verdana"/>
            </rPr>
            <t>bar</t>
            </r>
            <phoneticPr fontId="1" type="noConversion"/>
            </si>
            </sst>
            {code}
            jonah (Inactive) made changes -
            Link New: This issue is related to CONFSERVER-23376 [ CONFSERVER-23376 ]
            vkharisma made changes -
            Project Import New: Sat Apr 01 14:06:06 UTC 2017 [ 1491055566265 ]

              briosa Blake Riosa (Inactive)
              rchang Robert Chang
              Affected customers:
              56 This affects my team
              Watchers:
              64 Start watching this issue

                Created:
                Updated:
                Resolved: