Uploaded image for project: 'Confluence Data Center'
  1. Confluence Data Center
  2. CONFSERVER-23376

Mixed formatting in single cell in Excel document causes Confluence indexing errors

    XMLWordPrintable

Details

    Description

      NOTE: This bug report is for Confluence Server. Using Confluence Cloud? See the corresponding bug report.

      Steps to reproduce:

      1. Create an Excel spreadsheet and enter some text in a cell
      2. Select part of that cell and change its formatting (e.g. bold, italics, underline)
      3. Save this spreadsheet and upload to Confluence
      4. Rebuild search index

      Error in logs:

      2011-10-02 14:23:25,893 ERROR [Indexer: 1] [officeconnector.index.excel.ExcelXMLTextExtractor] endDocument expected [ 1 ] entries but read [ 2 ]
       -- referer: http://localhost:5350/admin/search-indexes.action | url: /admin/reindex.action | userName: admin | action: reindex
      

      An example .xlsx is attached.

      Here are the contents of sharedStrings.xml when the .xlsx is unarchived, which appears to contain the relevant data:

      <?xml version="1.0" encoding="UTF-8" standalone="yes"?>
      <sst xmlns="http://schemas.openxmlformats.org/spreadsheetml/2006/main" count="1" uniqueCount="1">
      	<si>
      		<r>
      			<t>foo</t>
      		</r>
      		<r>
      			<rPr>
      				<u/>
      				<sz val="10"/>
      				<rFont val="Verdana"/>
      			</rPr>
      			<t>bar</t>
      		</r>
      		<phoneticPr fontId="1" type="noConversion"/>
      	</si>
      </sst>
      

      Attachments

        Issue Links

          Activity

            People

              briosa Blake Riosa (Inactive)
              rchang Robert Chang
              Votes:
              56 Vote for this issue
              Watchers:
              64 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: