Clone Tools
  • last updated a few minutes ago
Constraints
Constraints: committers
 
Constraints: files
Constraints: dates
Fix randomly failing test

  1. … 1 more file in changeset.
Fix randomly failing test

  1. … 1 more file in changeset.
Fix Test randomly failing dependeing on OS End of line character

Fix Test randomly failing dependeing on OS End of line character

COR-356 : fix all javadoc errors raised by doclint (JDK 8)

  1. … 67 more files in changeset.
COR-356 : fix all javadoc errors raised by doclint (JDK 8)

  1. … 67 more files in changeset.
COR-337 : add new test XXE External Entity point to non-existing resource

    • -0
    • +32
    ./TestOpenOfficeDocumentReader.java
COR-338: add XXE unit test - external entity point to non-existing resource

    • -0
    • +32
    ./TestOpenOfficeDocumentReader.java
COR-338: Add XXE unit test - External Entity points to non-existing resource

    • -0
    • +32
    ./TestOpenOfficeDocumentReader.java
COR-337: Fix vulnerabilities related to XML parsing

    • -0
    • +61
    ./TestOpenOfficeDocumentReader.java
  1. … 13 more files in changeset.
COR-338: Fix vulnerabilities related to XML parsing

    • -0
    • +61
    ./TestOpenOfficeDocumentReader.java
  1. … 13 more files in changeset.
COR-338: Fix vulnerabilities related to XML parsing

    • -0
    • +61
    ./TestOpenOfficeDocumentReader.java
  1. … 13 more files in changeset.
COR-338: Fix vulnerabilities relating to XML parsing

Fix description:

* Use Apache poi-ooxml 3.8-eXo01 which:

** Switch from dom4j to JAXP (SAX)

** New helper class: SAXHelper

* Use SAXHelper instead of SAXParser in eXo Core's XML Document parsers

* Upgrade xmlbeans from 2.3 to 2.6 for MSXWordDocumentReader.

Both Apache poi-ooxml 3.8-eXo01 and Xmlbeans2.6 add XMLReader classe to read XML document before parsing.

The XMLReader initiated by SAXHelper has the parameters to prevent XEE/XXE attacks by setting maximum expansion entity and disabling external entity.

    • -0
    • +61
    ./TestOpenOfficeDocumentReader.java
  1. … 13 more files in changeset.
COR-332: Improved the test testGetContentAsString2

Update testGetContentAsString2

COR-332: Add a unit test to test the limit

  1. … 1 more file in changeset.
COR-329: Add a unit test to test the limit

  1. … 1 more file in changeset.
COR-332: Fixed the issue with the slide order

  1. … 2 more files in changeset.
COR-329: Fixed the issue with the slide order

  1. … 2 more files in changeset.
COR-332: getContentAsText and getProperties of MSXPPTDocumentReader are done by parsing the content thanks to SAX

  1. … 3 more files in changeset.
COR-329: getContentAsText and getProperties of MSXPPTDocumentReader are done by parsing the content thanks to SAX

  1. … 3 more files in changeset.
COR-331: Implement MSPPTXStreamDocumentReader using SAXParser

Problem analysis:

* Apache's POI for MS PPTX files provides only in-memory model.

In this model, SAXParser is used too many times (triple the slide number) even to get some meta data information.

It is therefore unsuitable to parse very big files (in terms of slide number).

Fix description:

* Implement a new document reader for PPTX files by reading the stream.

* Get meta data information directly from the corresponding file (core.xml) if this file exists.

* Parse and index text in a certain number of first slides.

    • -0
    • +173
    ./TestPPTXStreamDocumentReader.java
  1. … 4 more files in changeset.
COR-329: Streaming parser for MSXPPTDocumentReader

Fix description:

* Implement streaming model to get properties and content of Microsoft Powerpoint files (OOXML).

* Index the content of the first 500 slides.

  1. … 4 more files in changeset.
COR-332: Improve the way MS Excel files are parsed and indexed

    • -59
    • +12
    ./TestMSXExcelDocumentReader.java
  1. … 6 more files in changeset.
COR-329 : New implementation of the MSXExcelDocumentReader using the Event User Model approach to have lower memory footprint with big Excel files

    • -57
    • +12
    ./TestMSXExcelDocumentReader.java
  1. … 2 more files in changeset.
COR-329 : New implementation of the MSXExcelDocumentReader using the Event User Model approach to have lower memory footprint with big Excel files

    • -57
    • +12
    ./TestMSXExcelDocumentReader.java
  1. … 2 more files in changeset.
COR-329 : New implementation of the MSExcelDocumentReader using the Event User Model approach to have lower memory footprint with big Excel files

  1. … 1 more file in changeset.
COR-329 : New implementation of the MSExcelDocumentReader using the Event User Model approach to have lower memory footprint with big Excel files

  1. … 1 more file in changeset.
COR-329 : New implementation of the MSExcelDocumentReader using the Event User Model approach to have lower memory footprint with big Excel files

  1. … 1 more file in changeset.
COR-329 : New implementation of the MSExcelDocumentReader and MSXExcelDocumentReader using the Event User Model approach to have lower memory footprint with big Excel files + Improved version of POIPropertiesReader to have a lower memory footprint with XLSX, DOCX and PPTX documents

    • -59
    • +12
    ./TestMSXExcelDocumentReader.java
  1. … 6 more files in changeset.