extraction

Checkout Tools
  • last updated a few minutes ago
Constraints
Constraints: committers
 
Constraints: files
Constraints: dates
LUCENE-6917: rename/deprecate numeric classes in favor of dimensional values
  1. … 168 more files in changeset.
PDFBOX-3155: Disable tests that fail with new Java 9 version number scheme
  1. … 2 more files in changeset.
SOLR-8131: fix test solrconfig.xml files for the contrib modules
  1. … 14 more files in changeset.
SOLR-8330: Standardize and fix logger creation and usage so that they aren't shared across source files.
  1. … 501 more files in changeset.
SOLR-8180: jcl-over-slf4j is officially a solrj/solr dependency now; not marked optional in a POM.
  1. … 33 more files in changeset.
SOLR-8302: SolrResourceLoader takes a Path for its instance directory
  1. … 58 more files in changeset.
SOLR-8166: Add some null checks
SOLR-8166: Introduce possibility to configure ParseContext in ExtractingRequestHandler/ExtractingDocumentLoader
    • ?
    ./src/test-files/extraction/pdf-with-image.pdf
  1. … 1 more file in changeset.
LUCENE-6732: Remove tabs in JS and XML files
  1. … 116 more files in changeset.
Remove explicitly defined request handlers from example and test solrconfig's that are already defined implicitly
  1. … 55 more files in changeset.
SOLR-7532: Removed occurrences of the unused 'commitIntervalLowerBound' property for updateHandler elements from Solr configuration
  1. … 10 more files in changeset.
Remove unnecessary svn:executable from some files
  1. … 93 more files in changeset.
LUCENE-6378: Fix all RuntimeExceptions to throw the underlying root cause
  1. … 12 more files in changeset.
SOLR-7317: Remove jhighlight.jar which contains LGPL-only code
  1. … 4 more files in changeset.
SOLR-7139: Fix SolrContentHandler for TIKA to ignore multiple startDocument events
  1. … 1 more file in changeset.
LUCENE-4797: fix remaining html violations, engage linter in solr
  1. … 162 more files in changeset.
LUCENE-6224: cut over more package.htmls

  1. … 177 more files in changeset.
SOLR-6856: fix preceding whitespace for attribute values dumped into the catch-all field.
  1. … 1 more file in changeset.
SOLR-6856: Restore ExtractingRequestHandler's ability to capture all HTML tags when parsing (X)HTML.
  1. … 1 more file in changeset.
SOLR-6991,SOLR-6387: Under Turkish locale, don't run solr-cell and dataimporthandler-extras tests that use Tika
  1. … 1 more file in changeset.
SOLR-7014: Collapse identical catch branches in try-catch statements
  1. … 34 more files in changeset.
SOLR-6996: Add a test for ODF files in ExtractingRequestHandlerTest
    • ?
    ./src/test-files/extraction/open-document.odt
SOLR-6991: Update to Apache TIKA 1.7
  1. … 31 more files in changeset.
SOLR-6826: fieldType capitalization is not consistent with the rest of case-sensitive field names
  1. … 81 more files in changeset.
SOLR-6780: Fixed a bug in how default/appends/invariants params were affecting the set of all keys found in the request parameters, resulting in some key=value param pairs being duplicated.
  1. … 6 more files in changeset.
LUCENE-6007: Regularize ivy.xml files to use configurations that map to remote master configurations, so that Ivy won't try to download extraneous crap
  1. … 46 more files in changeset.
SOLR-6488: Upgrade Solr Cell to TIKA 1.6
  1. … 48 more files in changeset.
LUCENE-5901:Replaced all occurences of LUCENE_CURRENT with LATEST for luceneMatchVersion
  1. … 102 more files in changeset.
Fix test file name typo
SOLR-4385: Stop using SVN Keyword Substitution in Solr src code
  1. … 1888 more files in changeset.