first stab on fixing #1170 #1346

tarzanek · 2017-01-26T09:08:33Z

#1170

tarzanek · 2017-01-26T09:10:26Z

basically the only two things missing are:

pom.xml integration
position incrementor as in https://issues.apache.org/jira/secure/attachment/12663628/LUCENE-5897.patch - this needs to be tested, junit do some of tests, so let's see if they pass, I will run also helius farm job to get full checks

vladak · 2017-01-26T10:21:14Z

Couple of comments:

turn this on for javascript analyzer
let xref to be generated
fix the Maven somehow
produce some sort of message when the file is skipped

tulinkry · 2017-01-26T12:04:19Z

src/org/opensolaris/opengrok/analysis/plain/PlainSymbolTokenizer.lex

 import org.opensolaris.opengrok.analysis.JFlexTokenizer;
 %%
 %public
 %class PlainSymbolTokenizer


tulinkry · 2017-01-26T12:04:27Z

src/org/opensolaris/opengrok/analysis/plain/PlainFullTokenizer.lex

 %%

 %public
 %class PlainFullTokenizer


tulinkry · 2017-01-26T12:04:36Z

src/org/opensolaris/opengrok/analysis/javascript/JavaScriptSymbolTokenizer.lex


 %%
 %public
 %class JavaScriptSymbolTokenizer


tulinkry · 2017-01-26T12:04:47Z

src/org/opensolaris/opengrok/analysis/java/JavaSymbolTokenizer.lex

 %class JavaSymbolTokenizer
 %extends JFlexTokenizer
 %init{
 super(in);


tulinkry · 2017-01-26T12:05:11Z

opengrok-indexer/pom.xml

 </execution>
 </executions>
 </plugin>
+<!-- TODO add the same fix as is in build.xml to patch jflex generated files to stop increasing buffer beyond token size that lucene accepts


maybe copyright?

tarzanek · 2017-01-27T09:24:23Z

@tulinkry , copyright fixed ... will you have time to add the regexps to pom.xml ?

tarzanek · 2017-01-27T15:26:52Z

let's merge this, let the fun begin :-D

tarzanek · 2017-01-27T15:28:36Z

fwiw -
fix the Maven somehow - let's sort this out in next pull req
produce some sort of message when the file is skipped - file won't be skipped, just the buggy token, which I think is safe to ignore, I am still afraid the index size and search speed will be impacted by this, since we might get 1MB doc fields thanks to this ...

first stab on fixing oracle#1170

ff70ede

tarzanek self-assigned this Jan 26, 2017

tarzanek added this to the 0.13 milestone Jan 26, 2017

fix more analysers per internal scanner, don't touch xrefs

ba14bbd

tulinkry suggested changes Jan 26, 2017

View reviewed changes

copyright fixes

b40c46f

tulinkry approved these changes Jan 27, 2017

View reviewed changes

tarzanek merged commit e4df909 into oracle:master Jan 27, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

first stab on fixing #1170 #1346

first stab on fixing #1170 #1346

Uh oh!

tarzanek commented Jan 26, 2017 •

edited by tulinkry

Loading

tarzanek commented Jan 26, 2017

vladak commented Jan 26, 2017

tulinkry Jan 26, 2017

tulinkry Jan 26, 2017

tulinkry Jan 26, 2017

tulinkry Jan 26, 2017

tulinkry Jan 26, 2017

tarzanek commented Jan 27, 2017

tarzanek commented Jan 27, 2017

tarzanek commented Jan 27, 2017

Labels

3 participants

first stab on fixing #1170 #1346

first stab on fixing #1170 #1346

Uh oh!

Conversation

tarzanek commented Jan 26, 2017 • edited by tulinkry Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

tarzanek commented Jan 26, 2017

vladak commented Jan 26, 2017

tulinkry Jan 26, 2017

Choose a reason for hiding this comment

tulinkry Jan 26, 2017

Choose a reason for hiding this comment

tulinkry Jan 26, 2017

Choose a reason for hiding this comment

tulinkry Jan 26, 2017

Choose a reason for hiding this comment

tulinkry Jan 26, 2017

Choose a reason for hiding this comment

tarzanek commented Jan 27, 2017

tarzanek commented Jan 27, 2017

tarzanek commented Jan 27, 2017

Labels

3 participants

tarzanek commented Jan 26, 2017 •

edited by tulinkry

Loading