Checkout Tools
  • last updated 5 hours ago
Constraints
Constraints: committers
 
Constraints: files
Constraints: dates
remaining eol-style fixes to trunk, native except .sh (LF)
    • ?
    ./org/apache/lucene/analysis/ga/irish.sbl.txt
  1. … 230 more files in changeset.
fix eol-style
  1. … 196 more files in changeset.
LUCENE-3919: fix czechstemmer aioobe on the empty term
  1. … 71 more files in changeset.
LUCENE-3913: Fix HTMLStripCharFilter invalid final offset for input containing </br>
  1. … 3 more files in changeset.
LUCENE-3883: Irish Analyzer
    • ?
    ./org/tartarus/snowball/ext/IrishStemmer.java
    • ?
    ./org/apache/lucene/analysis/ga/IrishAnalyzer.java
  1. … 14 more files in changeset.
LUCENE-3905: sometimes run real-ish content (from LineFileDocs) through the analyzers too; fix end() offset bugs in the ngram tokenizers/filters
  1. … 9 more files in changeset.
add missing license headers
  1. … 7 more files in changeset.
LUCENE-3898: reset() was missing some state
  1. … 5 more files in changeset.
LUCENE-3894: some tokenizers weren't reading all input chars
  1. … 9 more files in changeset.
LUCENE-3889: remove unnecessary/unused base class
  1. … 4 more files in changeset.
SOLR-2764: Create a NorwegianLightStemmer and NorwegianMinimalStemmer (backport)
  1. … 13 more files in changeset.
basic javadocs improvements, mostly simple descriptions where the class had nothing before
  1. … 57 more files in changeset.
javadocs: add missing package.htmls
    • ?
    ./org/apache/lucene/analysis/util/package.html
    • ?
    ./org/tartarus/snowball/package.html
    • ?
    ./org/tartarus/snowball/ext/package.html
    • ?
    ./org/apache/lucene/analysis/path/package.html
    • ?
    ./org/apache/lucene/analysis/charfilter/package.html
  1. … 27 more files in changeset.
LUCENE-3848: don't produce tokenstreams that start with posinc=0
  1. … 9 more files in changeset.
Kuromoji now generates compounds and the segmentations of these compounds in search mode (backport of LUCENE-3767)
  1. … 32 more files in changeset.
LUCENE-3748: EnglishPossessiveFilter did not work with a proper right quotation mark
  1. … 7 more files in changeset.
LUCENE-3765: Trappy behavior with StopFilter/ignoreCase
  1. … 32 more files in changeset.
SOLR-3097, SOLR-3105: add fieldtypes for different languages to the example
    • ?
    ./org/apache/lucene/analysis/in/IndicTokenizer.java
  1. … 24 more files in changeset.
LUCENE-3742: fix token offset for hangs-off-end output in SynonymFilter
  1. … 7 more files in changeset.
LUCENE-3725: add optional packing to FSTs
  1. … 22 more files in changeset.
LUCENE-3690: Re-implemented HTMLStripCharFilter as a JFlex-generated scanner, and moved from Solr to Lucene Common Analyzers contrib. Fixes LUCENE-2208, SOLR-882, and SOLR-42.
    • ?
    ./org/apache/lucene/analysis/charfilter/htmlentity.py
  1. … 21 more files in changeset.
LUCENE-3717: add tests
  1. … 7 more files in changeset.
LUCENE-3717: fix broken offsets in ngramtokenizers, and check return value of Reader.read
  1. … 8 more files in changeset.
LUCENE-3717: add checkRandomData to more analyzers and fix more offsets bugs
  1. … 23 more files in changeset.
LUCENE-3717: add better offsets testing to BaseTokenStreamTestCase, fix offsets bugs in ThaiWordFilter and ICUTokenizer
  1. … 10 more files in changeset.
SOLR-2891: fix CompoundWordTokenFilter to not create invalid offsets when the length of the text was changed by a previous filter
  1. … 6 more files in changeset.
LUCENE-3695: move some confusing FST sugar out
  1. … 7 more files in changeset.
LUCENE-3305: add Kuromoji Japanese morphological analyzer
  1. … 46 more files in changeset.
SOLR-3020: Add KeywordAttribute support to HunspellStemFilter
  1. … 8 more files in changeset.
LUCENE-3679: replace IR.getFieldNames with IR.getFieldInfos
  1. … 43 more files in changeset.