Clone Tools
  • last updated a few minutes ago
Constraints
Constraints: committers
 
Constraints: files
Constraints: dates
COR-356 : fix all javadoc errors raised by doclint (JDK 8)

    • -1
    • +1
    ./TestMSExcelOnTikaDocumentReader.java
    • -1
    • +1
    ./TestMSOutlookOnTikaDocumentReader.java
    • -1
    • +1
    ./TestMSVisioOnTikaDocumentReader.java
    • -1
    • +1
    ./TestMSXExcelOnTikaDocumentReader.java
    • -1
    • +1
    ./TestMSXWordOnTikaDocumentReader.java
    • -1
    • +1
    ./TestOpenOfficeOnTikaDocumentReader.java
    • -1
    • +1
    ./TestTextPlainOnTikaDocumentReader.java
  1. … 54 more files in changeset.
COR-356 : fix all javadoc errors raised by doclint (JDK 8)

    • -1
    • +1
    ./TestMSExcelOnTikaDocumentReader.java
    • -1
    • +1
    ./TestMSOutlookOnTikaDocumentReader.java
    • -1
    • +1
    ./TestMSVisioOnTikaDocumentReader.java
    • -1
    • +1
    ./TestMSXExcelOnTikaDocumentReader.java
    • -1
    • +1
    ./TestMSXWordOnTikaDocumentReader.java
    • -1
    • +1
    ./TestOpenOfficeOnTikaDocumentReader.java
    • -1
    • +1
    ./TestTextPlainOnTikaDocumentReader.java
  1. … 54 more files in changeset.
COR-354: Upgrade the versions of pdfbox, poi, tika

Fix description:

* Update the versions in the main pom

* Remove InvalidPasswordExcetion in PDDocument.decrypt(). This change exists since pdfbox 1.8.6 (PDFBOX-1474).

* TIKA-1400 (tika 1.10) extracts the header and footer of Excel file (.xls). The information is then put into class "outside".

The output of TestMSExcelOnTikaDocumentReader must be therefore updated.

    • -2
    • +8
    ./TestMSExcelOnTikaDocumentReader.java
  1. … 2 more files in changeset.
COR-334: Fix the test TestPropertiesExtractionOnTika.testPPTDocumentReaderService()

COR-333: TikaDocumentReader causes 'Unparseable date'

    • -6
    • +10
    ./TestMSXExcelOnTikaDocumentReader.java
    • -7
    • +44
    ./TestPropertiesExtractionOnTika.java
  1. … 4 more files in changeset.
COR-333: TikaDocumentReader causes 'Unparseable date'

Fix description:

* Don't convert date value extracted from document's properties to String of Java's Date object. This format doesn't conform to ISO8601 standard used in JCR

    • -6
    • +10
    ./TestMSXExcelOnTikaDocumentReader.java
    • -7
    • +44
    ./TestPropertiesExtractionOnTika.java
  1. … 4 more files in changeset.
COR-333: TikaDocumentReader causes 'Unparseable date'

    • -6
    • +10
    ./TestMSXExcelOnTikaDocumentReader.java
    • -18
    • +41
    ./TestPropertiesExtractionOnTika.java
  1. … 4 more files in changeset.
COR-337: Fix vulnerabilities related to XML parsing

    • -0
    • +79
    ./TestMSXExcelOnTikaDocumentReader.java
    • -0
    • +57
    ./TestMSXPPTOnTikaDocumentReader.java
    • -2
    • +61
    ./TestMSXWordOnTikaDocumentReader.java
    • -0
    • +61
    ./TestOpenOfficeOnTikaDocumentReader.java
    • -1
    • +285
    ./TestPropertiesExtractionOnTika.java
  1. … 14 more files in changeset.
COR-338: Fix vulnerabilities related to XML parsing

    • -0
    • +79
    ./TestMSXExcelOnTikaDocumentReader.java
    • -0
    • +57
    ./TestMSXPPTOnTikaDocumentReader.java
    • -2
    • +61
    ./TestMSXWordOnTikaDocumentReader.java
    • -0
    • +61
    ./TestOpenOfficeOnTikaDocumentReader.java
    • -7
    • +283
    ./TestPropertiesExtractionOnTika.java
  1. … 14 more files in changeset.
COR-338: Fix vulnerabilities related to XML parsing

    • -0
    • +79
    ./TestMSXExcelOnTikaDocumentReader.java
    • -0
    • +57
    ./TestMSXPPTOnTikaDocumentReader.java
    • -2
    • +61
    ./TestMSXWordOnTikaDocumentReader.java
    • -0
    • +61
    ./TestOpenOfficeOnTikaDocumentReader.java
    • -7
    • +283
    ./TestPropertiesExtractionOnTika.java
  1. … 14 more files in changeset.
COR-338: Fix vulnerabilities relating to XML parsing

Fix description:

* Use Apache poi-ooxml 3.8-eXo01 which:

** Switch from dom4j to JAXP (SAX)

** New helper class: SAXHelper

* Use SAXHelper instead of SAXParser in eXo Core's XML Document parsers

* Upgrade xmlbeans from 2.3 to 2.6 for MSXWordDocumentReader.

Both Apache poi-ooxml 3.8-eXo01 and Xmlbeans2.6 add XMLReader classe to read XML document before parsing.

The XMLReader initiated by SAXHelper has the parameters to prevent XEE/XXE attacks by setting maximum expansion entity and disabling external entity.

    • -0
    • +79
    ./TestMSXExcelOnTikaDocumentReader.java
    • -0
    • +57
    ./TestMSXPPTOnTikaDocumentReader.java
    • -2
    • +61
    ./TestMSXWordOnTikaDocumentReader.java
    • -0
    • +61
    ./TestOpenOfficeOnTikaDocumentReader.java
    • -7
    • +283
    ./TestPropertiesExtractionOnTika.java
  1. … 14 more files in changeset.
COR-333: Re-add the todos related to dates

COR-334: Re-add the todos related to dates

COR-334: TikaDocumentReader causes 'Unparseable date'

    • -6
    • +10
    ./TestMSXExcelOnTikaDocumentReader.java
    • -27
    • +50
    ./TestPropertiesExtractionOnTika.java
  1. … 4 more files in changeset.
COR-333: TikaDocumentReader causes 'Unparseable date'

    • -6
    • +10
    ./TestMSXExcelOnTikaDocumentReader.java
    • -27
    • +50
    ./TestPropertiesExtractionOnTika.java
  1. … 4 more files in changeset.
COR-306: Upgrade to Tika 1.4

  1. … 2 more files in changeset.
COR-281 : Can not get properties: Your document contained more than 10240 characters, and so your requested limit has been reached.

    • -0
    • +14
    ./TestPropertiesExtractionOnTika.java
  1. … 6 more files in changeset.
COR-280 : Can not get properties: Your document contained more than 10240 characters, and so your requested limit has been reached.

    • -0
    • +14
    ./TestPropertiesExtractionOnTika.java
  1. … 6 more files in changeset.
COR-280 : Can not get properties: Your document contained more than 10240 characters, and so your requested limit has been reached.

    • -0
    • +14
    ./TestPropertiesExtractionOnTika.java
  1. … 3 more files in changeset.
COR-278 : IllegalArgumentException when upload a vsd file via CE core-2.6.x

    • -0
    • +65
    ./TestMSVisioOnTikaDocumentReader.java
  1. … 4 more files in changeset.
COR-278 : IllegalArgumentException when upload a vsd file via CE

    • -0
    • +64
    ./TestMSVisioOnTikaDocumentReader.java
  1. … 4 more files in changeset.
EXOJCR-1889: logging cleanup

  1. … 24 more files in changeset.
EXOJCR-1864: Fixed incorrect date in tests

  1. … 3 more files in changeset.
EXOJCR-749: TestPropertiesExtractionOnTika dates fixed

    • -10
    • +4
    ./TestPropertiesExtractionOnTika.java
EXOJCR-749: tests fixed

    • -2
    • +2
    ./TestOpenOfficeOnTikaDocumentReader.java
  1. … 3 more files in changeset.
EXOJCR-749: TikaDocumentReader added; tests added

    • -0
    • +81
    ./TestHtmlOnTikaDocumentReader.java
    • -0
    • +106
    ./TestMSExcelOnTikaDocumentReader.java
    • -0
    • +88
    ./TestMSOutlookOnTikaDocumentReader.java
    • -0
    • +60
    ./TestMSWordOnTikaDocumentReader.java
    • -0
    • +110
    ./TestMSXExcelOnTikaDocumentReader.java
    • -0
    • +63
    ./TestMSXPPTOnTikaDocumentReader.java
    • -0
    • +91
    ./TestMSXWordOnTikaDocumentReader.java
    • -0
    • +140
    ./TestMimetypes.java
    • -0
    • +88
    ./TestOpenOfficeOnTikaDocumentReader.java
    • -0
    • +87
    ./TestPDFOnTikaDocumentReader.java
    • -0
    • +93
    ./TestPPTOnTikaDocumentReader.java
    • -0
    • +302
    ./TestPropertiesExtractionOnTika.java
    • -0
    • +129
    ./TestTextPlainOnTikaDocumentReader.java
    • -0
    • +117
    ./TestXMLOnTikaDocumentReader.java
  1. … 25 more files in changeset.