List Info

Thread: Created: (LUCENE-794) Beginnings of a span based highlighter




Updated: (LUCENE-794) SpanScorer and SimpleSpanFragmenter for Contrib Highlighter
country flaguser name
United States
2007-06-01 09:42:16
     [ https://issues.apache.org/jira/browse/LUCENE-794?page=com.
atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Mark Miller updated LUCENE-794:
-------------------------------

    Attachment: spanhighlighter7.patch

Minor update to straighten a few things out.

- Replaced custom CachingTokenStream with Lucene's
CachingTokenFilter
- Some refactoring in the SpanExtractor (now
WeightedSpanTermExtractor)
- Updated some stale JavaDoc

> SpanScorer and SimpleSpanFragmenter for Contrib
Highlighter
>
-----------------------------------------------------------
>
>                 Key: LUCENE-794
>                 URL: http
s://issues.apache.org/jira/browse/LUCENE-794
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Other
>            Reporter: Mark Miller
>            Priority: Minor
>         Attachments: CachedTokenStream.java,
CachedTokenStream.java, CachedTokenStream.java,
DefaultEncoder.java, Encoder.java, Formatter.java,
Highlighter.java, Highlighter.java, Highlighter.java,
Highlighter.java, Highlighter.java, HighlighterTest.java,
HighlighterTest.java, HighlighterTest.java,
HighlighterTest.java, MemoryIndex.java,
QuerySpansExtractor.java, QuerySpansExtractor.java,
QuerySpansExtractor.java, QuerySpansExtractor.java,
SimpleFormatter.java, spanhighlighter.patch,
spanhighlighter2.patch, spanhighlighter3.patch,
spanhighlighter5.patch, spanhighlighter6.patch,
spanhighlighter7.patch, spanhighlighter_patch_4.zip,
SpanHighlighterTest.java, SpanHighlighterTest.java,
SpanScorer.java, SpanScorer.java, WeightedSpanTerm.java
>
>
> This patch adds a new Scorer class (SpanQueryScorer) to
the Highlighter package that scores just like QueryScorer,
but scores a 0 for Terms that did not cause the Query hit.
This gives 'actual' hit highlighting for the range of
SpanQuerys and PhraseQuery. There is also a new Fragmenter
that attempts to fragment without breaking up Spans.
> See http:
//issues.apache.org/jira/browse/LUCENE-403 for some
background.
> There is a dependency on MemoryIndex.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


------------------------------------------------------------
---------
To unsubscribe, e-mail: java-dev-unsubscribelucene.apache.org
For additional commands, e-mail: java-dev-helplucene.apache.org


Commented: (LUCENE-794) SpanScorer and SimpleSpanFragmenter for Contrib Highlighter
country flaguser name
United States
2007-06-05 15:07:26
    [ https://issues.apache.org/jira/browse/
LUCENE-794?page=com.atlassian.jira.plugin.system.issuetabpan
els:comment-tabpanel#action_12501682 ] 

Mark Miller commented on LUCENE-794:
------------------------------------

I plan on one more release and than I am finished.

I need to optimize the scoring (stop looking at positions
for terms that are not position sensitive)

Make a couple unit tests to check for a bug I suspect

Turn the javadoc's into something I am actually proud of.

I would wait for this final patch before taking a look at
this Mark H.

I apologize for being so incremental on this issue...lesson
learned.

- Mark

> SpanScorer and SimpleSpanFragmenter for Contrib
Highlighter
>
-----------------------------------------------------------
>
>                 Key: LUCENE-794
>                 URL: http
s://issues.apache.org/jira/browse/LUCENE-794
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Other
>            Reporter: Mark Miller
>            Priority: Minor
>         Attachments: CachedTokenStream.java,
CachedTokenStream.java, CachedTokenStream.java,
DefaultEncoder.java, Encoder.java, Formatter.java,
Highlighter.java, Highlighter.java, Highlighter.java,
Highlighter.java, Highlighter.java, HighlighterTest.java,
HighlighterTest.java, HighlighterTest.java,
HighlighterTest.java, MemoryIndex.java,
QuerySpansExtractor.java, QuerySpansExtractor.java,
QuerySpansExtractor.java, QuerySpansExtractor.java,
SimpleFormatter.java, spanhighlighter.patch,
spanhighlighter2.patch, spanhighlighter3.patch,
spanhighlighter5.patch, spanhighlighter6.patch,
spanhighlighter7.patch, spanhighlighter_patch_4.zip,
SpanHighlighterTest.java, SpanHighlighterTest.java,
SpanScorer.java, SpanScorer.java, WeightedSpanTerm.java
>
>
> This patch adds a new Scorer class (SpanQueryScorer) to
the Highlighter package that scores just like QueryScorer,
but scores a 0 for Terms that did not cause the Query hit.
This gives 'actual' hit highlighting for the range of
SpanQuerys and PhraseQuery. There is also a new Fragmenter
that attempts to fragment without breaking up Spans.
> See http:
//issues.apache.org/jira/browse/LUCENE-403 for some
background.
> There is a dependency on MemoryIndex.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


------------------------------------------------------------
---------
To unsubscribe, e-mail: java-dev-unsubscribelucene.apache.org
For additional commands, e-mail: java-dev-helplucene.apache.org


Updated: (LUCENE-794) Extend contrib Highlighter to properly support phrase queries and span
country flaguser name
United States
2007-06-27 16:22:26
     [ https://issues.apache.org/jira/browse/LUCENE-794?page=com.
atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Mark Miller updated LUCENE-794:
-------------------------------

    Attachment: spanhighlighter9.patch

patch version 9 : Apply to root dir of trunk 

Various small improvements.

Be sure to use the recently updated CachingTokenFilter for
optimal performance.

> Extend contrib Highlighter to properly support phrase
queries and span queries
>
------------------------------------------------------------
------------------
>
>                 Key: LUCENE-794
>                 URL: http
s://issues.apache.org/jira/browse/LUCENE-794
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Other
>            Reporter: Mark Miller
>            Priority: Minor
>         Attachments: CachedTokenStream.java,
CachedTokenStream.java, CachedTokenStream.java,
DefaultEncoder.java, Encoder.java, Formatter.java,
Highlighter.java, Highlighter.java, Highlighter.java,
Highlighter.java, Highlighter.java, HighlighterTest.java,
HighlighterTest.java, HighlighterTest.java,
HighlighterTest.java, MemoryIndex.java,
QuerySpansExtractor.java, QuerySpansExtractor.java,
QuerySpansExtractor.java, QuerySpansExtractor.java,
SimpleFormatter.java, spanhighlighter.patch,
spanhighlighter2.patch, spanhighlighter3.patch,
spanhighlighter5.patch, spanhighlighter6.patch,
spanhighlighter7.patch, spanhighlighter8.patch,
spanhighlighter9.patch, spanhighlighter_patch_4.zip,
SpanHighlighterTest.java, SpanHighlighterTest.java,
SpanScorer.java, SpanScorer.java, WeightedSpanTerm.java
>
>
> This patch adds a new Scorer class (SpanQueryScorer) to
the Highlighter package that scores just like QueryScorer,
but scores a 0 for Terms that did not cause the Query hit.
This gives 'actual' hit highlighting for the range of
SpanQuerys and PhraseQuery. There is also a new Fragmenter
that attempts to fragment without breaking up Spans.
> See http:
//issues.apache.org/jira/browse/LUCENE-403 for some
background.
> There is a dependency on MemoryIndex.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


------------------------------------------------------------
---------
To unsubscribe, e-mail: java-dev-unsubscribelucene.apache.org
For additional commands, e-mail: java-dev-helplucene.apache.org


Commented: (LUCENE-794) Extend contrib Highlighter to properly support phrase queries and spa
country flaguser name
United States
2007-07-26 18:39:03
    [ https://issues.apache.org/jira/browse/
LUCENE-794?page=com.atlassian.jira.plugin.system.issuetabpan
els:comment-tabpanel#action_12515911 ] 

Otis Gospodnetic commented on LUCENE-794:
-----------------------------------------

Mark, wow, long list of files up there.  I can't tell which
ones are still relevant.  Ah, only  spanhighlighter9.patch,
right?

It looks like all files in that patch are new files, that
is, this is a parallel highlighter implementation - we can
leave the old one in there and commit yours without worrying
about breaking the old one.  Could you add Apache license
headers to all files, switch to 2 spaces for indentation,
and then I think this can get committed?

Oh, and since contrib can be java 1.5+, I think you can use
StringBuilder instead of StringBuffer, etc.


> Extend contrib Highlighter to properly support phrase
queries and span queries
>
------------------------------------------------------------
------------------
>
>                 Key: LUCENE-794
>                 URL: http
s://issues.apache.org/jira/browse/LUCENE-794
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Other
>            Reporter: Mark Miller
>            Priority: Minor
>         Attachments: CachedTokenStream.java,
CachedTokenStream.java, CachedTokenStream.java,
DefaultEncoder.java, Encoder.java, Formatter.java,
Highlighter.java, Highlighter.java, Highlighter.java,
Highlighter.java, Highlighter.java, HighlighterTest.java,
HighlighterTest.java, HighlighterTest.java,
HighlighterTest.java, MemoryIndex.java,
QuerySpansExtractor.java, QuerySpansExtractor.java,
QuerySpansExtractor.java, QuerySpansExtractor.java,
SimpleFormatter.java, spanhighlighter.patch,
spanhighlighter2.patch, spanhighlighter3.patch,
spanhighlighter5.patch, spanhighlighter6.patch,
spanhighlighter7.patch, spanhighlighter8.patch,
spanhighlighter9.patch, spanhighlighter_patch_4.zip,
SpanHighlighterTest.java, SpanHighlighterTest.java,
SpanScorer.java, SpanScorer.java, WeightedSpanTerm.java
>
>
> This patch adds a new Scorer class (SpanQueryScorer) to
the Highlighter package that scores just like QueryScorer,
but scores a 0 for Terms that did not cause the Query hit.
This gives 'actual' hit highlighting for the range of
SpanQuerys and PhraseQuery. There is also a new Fragmenter
that attempts to fragment without breaking up Spans.
> See http:
//issues.apache.org/jira/browse/LUCENE-403 for some
background.
> There is a dependency on MemoryIndex.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


------------------------------------------------------------
---------
To unsubscribe, e-mail: java-dev-unsubscribelucene.apache.org
For additional commands, e-mail: java-dev-helplucene.apache.org


Updated: (LUCENE-794) Extend contrib Highlighter to properly support phrase queries and span
country flaguser name
United States
2007-07-26 20:35:04
     [ https://issues.apache.org/jira/browse/LUCENE-794?page=com.
atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Mark Miller updated LUCENE-794:
-------------------------------

    Attachment: spanhighlighter10.patch

Requested changes have been made. Only relevant file now is
spanhighlighter10.patch.

This is a parallel implementation...it uses all of the
current Highlighter classes. Really, it is just a new Scorer
implementation that scores position sensitive queries based
on correct positions for a hit.

The whole approach was radically changed from the
StringBuilder version, so all code is still Java 1.4
compatible.

I have been using this extensively with great success for a
few months now.

> Extend contrib Highlighter to properly support phrase
queries and span queries
>
------------------------------------------------------------
------------------
>
>                 Key: LUCENE-794
>                 URL: http
s://issues.apache.org/jira/browse/LUCENE-794
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Other
>            Reporter: Mark Miller
>            Priority: Minor
>         Attachments: CachedTokenStream.java,
CachedTokenStream.java, CachedTokenStream.java,
DefaultEncoder.java, Encoder.java, Formatter.java,
Highlighter.java, Highlighter.java, Highlighter.java,
Highlighter.java, Highlighter.java, HighlighterTest.java,
HighlighterTest.java, HighlighterTest.java,
HighlighterTest.java, MemoryIndex.java,
QuerySpansExtractor.java, QuerySpansExtractor.java,
QuerySpansExtractor.java, QuerySpansExtractor.java,
SimpleFormatter.java, spanhighlighter.patch,
spanhighlighter10.patch, spanhighlighter2.patch,
spanhighlighter3.patch, spanhighlighter5.patch,
spanhighlighter6.patch, spanhighlighter7.patch,
spanhighlighter8.patch, spanhighlighter9.patch,
spanhighlighter_patch_4.zip, SpanHighlighterTest.java,
SpanHighlighterTest.java, SpanScorer.java, SpanScorer.java,
WeightedSpanTerm.java
>
>
> This patch adds a new Scorer class (SpanQueryScorer) to
the Highlighter package that scores just like QueryScorer,
but scores a 0 for Terms that did not cause the Query hit.
This gives 'actual' hit highlighting for the range of
SpanQuerys and PhraseQuery. There is also a new Fragmenter
that attempts to fragment without breaking up Spans.
> See http:
//issues.apache.org/jira/browse/LUCENE-403 for some
background.
> There is a dependency on MemoryIndex.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


------------------------------------------------------------
---------
To unsubscribe, e-mail: java-dev-unsubscribelucene.apache.org
For additional commands, e-mail: java-dev-helplucene.apache.org


Updated: (LUCENE-794) Extend contrib Highlighter to properly support phrase queries and span
country flaguser name
United States
2007-09-13 10:53:33
     [ https://issues.apache.org/jira/browse/LUCENE-794?page=com.
atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Mark Miller updated LUCENE-794:
-------------------------------

    Attachment: spanhighlighter11.patch

Thanks a lot Andy. As I suspected, the issue is that the
conversion from PhraseQuery to SpanQuery is inexact. I have
updated the code to handle this case though. If a
PhraseQuery has 0 slop then the created Span query will now
force an inorder match. This should be a nice improvement to
the PhraseQuery to SpanQuery approximation.

Patch with fix and new junit test attached.

patch 11

- Mark

> Extend contrib Highlighter to properly support phrase
queries and span queries
>
------------------------------------------------------------
------------------
>
>                 Key: LUCENE-794
>                 URL: http
s://issues.apache.org/jira/browse/LUCENE-794
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Other
>            Reporter: Mark Miller
>            Priority: Minor
>         Attachments: CachedTokenStream.java,
CachedTokenStream.java, CachedTokenStream.java,
DefaultEncoder.java, Encoder.java, Formatter.java,
Highlighter.java, Highlighter.java, Highlighter.java,
Highlighter.java, Highlighter.java, HighlighterTest.java,
HighlighterTest.java, HighlighterTest.java,
HighlighterTest.java, MemoryIndex.java,
QuerySpansExtractor.java, QuerySpansExtractor.java,
QuerySpansExtractor.java, QuerySpansExtractor.java,
SimpleFormatter.java, spanhighlighter.patch,
spanhighlighter10.patch, spanhighlighter11.patch,
spanhighlighter2.patch, spanhighlighter3.patch,
spanhighlighter5.patch, spanhighlighter6.patch,
spanhighlighter7.patch, spanhighlighter8.patch,
spanhighlighter9.patch, spanhighlighter_patch_4.zip,
SpanHighlighterTest.java, SpanHighlighterTest.java,
SpanScorer.java, SpanScorer.java, WeightedSpanTerm.java
>
>
> This patch adds a new Scorer class (SpanQueryScorer) to
the Highlighter package that scores just like QueryScorer,
but scores a 0 for Terms that did not cause the Query hit.
This gives 'actual' hit highlighting for the range of
SpanQuerys and PhraseQuery. There is also a new Fragmenter
that attempts to fragment without breaking up Spans.
> See http:
//issues.apache.org/jira/browse/LUCENE-403 for some
background.
> There is a dependency on MemoryIndex.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


------------------------------------------------------------
---------
To unsubscribe, e-mail: java-dev-unsubscribelucene.apache.org
For additional commands, e-mail: java-dev-helplucene.apache.org


Updated: (LUCENE-794) Extend contrib Highlighter to properly support phrase queries and span
country flaguser name
United States
2007-10-19 09:48:52
     [ https://issues.apache.org/jira/browse/LUCENE-794?page=com.
atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Mark Miller updated LUCENE-794:
-------------------------------

    Attachment:     (was: CachedTokenStream.java)

> Extend contrib Highlighter to properly support phrase
queries and span queries
>
------------------------------------------------------------
------------------
>
>                 Key: LUCENE-794
>                 URL: http
s://issues.apache.org/jira/browse/LUCENE-794
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Other
>            Reporter: Mark Miller
>            Priority: Minor
>         Attachments: Encoder.java, Formatter.java,
Highlighter.java, Highlighter.java, Highlighter.java,
Highlighter.java, Highlighter.java, HighlighterTest.java,
HighlighterTest.java, HighlighterTest.java,
HighlighterTest.java, MemoryIndex.java,
QuerySpansExtractor.java, QuerySpansExtractor.java,
QuerySpansExtractor.java, QuerySpansExtractor.java,
SimpleFormatter.java, spanhighlighter.patch,
spanhighlighter10.patch, spanhighlighter11.patch,
spanhighlighter12.patch, spanhighlighter2.patch,
spanhighlighter3.patch, spanhighlighter5.patch,
spanhighlighter6.patch, spanhighlighter7.patch,
spanhighlighter8.patch, spanhighlighter9.patch,
spanhighlighter_patch_4.zip, SpanHighlighterTest.java,
SpanHighlighterTest.java, SpanScorer.java, SpanScorer.java,
WeightedSpanTerm.java
>
>
> This patch adds a new Scorer class (SpanQueryScorer) to
the Highlighter package that scores just like QueryScorer,
but scores a 0 for Terms that did not cause the Query hit.
This gives 'actual' hit highlighting for the range of
SpanQuerys and PhraseQuery. There is also a new Fragmenter
that attempts to fragment without breaking up Spans.
> See http:
//issues.apache.org/jira/browse/LUCENE-403 for some
background.
> There is a dependency on MemoryIndex.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


------------------------------------------------------------
---------
To unsubscribe, e-mail: java-dev-unsubscribelucene.apache.org
For additional commands, e-mail: java-dev-helplucene.apache.org


Updated: (LUCENE-794) Extend contrib Highlighter to properly support phrase queries and span
country flaguser name
United States
2007-10-19 09:48:53
     [ https://issues.apache.org/jira/browse/LUCENE-794?page=com.
atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Mark Miller updated LUCENE-794:
-------------------------------

    Attachment:     (was: CachedTokenStream.java)

> Extend contrib Highlighter to properly support phrase
queries and span queries
>
------------------------------------------------------------
------------------
>
>                 Key: LUCENE-794
>                 URL: http
s://issues.apache.org/jira/browse/LUCENE-794
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Other
>            Reporter: Mark Miller
>            Priority: Minor
>         Attachments: Encoder.java, Formatter.java,
Highlighter.java, Highlighter.java, Highlighter.java,
Highlighter.java, Highlighter.java, HighlighterTest.java,
HighlighterTest.java, HighlighterTest.java,
HighlighterTest.java, MemoryIndex.java,
QuerySpansExtractor.java, QuerySpansExtractor.java,
QuerySpansExtractor.java, QuerySpansExtractor.java,
SimpleFormatter.java, spanhighlighter.patch,
spanhighlighter10.patch, spanhighlighter11.patch,
spanhighlighter12.patch, spanhighlighter2.patch,
spanhighlighter3.patch, spanhighlighter5.patch,
spanhighlighter6.patch, spanhighlighter7.patch,
spanhighlighter8.patch, spanhighlighter9.patch,
spanhighlighter_patch_4.zip, SpanHighlighterTest.java,
SpanHighlighterTest.java, SpanScorer.java, SpanScorer.java,
WeightedSpanTerm.java
>
>
> This patch adds a new Scorer class (SpanQueryScorer) to
the Highlighter package that scores just like QueryScorer,
but scores a 0 for Terms that did not cause the Query hit.
This gives 'actual' hit highlighting for the range of
SpanQuerys and PhraseQuery. There is also a new Fragmenter
that attempts to fragment without breaking up Spans.
> See http:
//issues.apache.org/jira/browse/LUCENE-403 for some
background.
> There is a dependency on MemoryIndex.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


------------------------------------------------------------
---------
To unsubscribe, e-mail: java-dev-unsubscribelucene.apache.org
For additional commands, e-mail: java-dev-helplucene.apache.org


Updated: (LUCENE-794) Extend contrib Highlighter to properly support phrase queries and span
country flaguser name
United States
2007-10-19 09:48:54
     [ https://issues.apache.org/jira/browse/LUCENE-794?page=com.
atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Mark Miller updated LUCENE-794:
-------------------------------

    Attachment:     (was: CachedTokenStream.java)

> Extend contrib Highlighter to properly support phrase
queries and span queries
>
------------------------------------------------------------
------------------
>
>                 Key: LUCENE-794
>                 URL: http
s://issues.apache.org/jira/browse/LUCENE-794
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Other
>            Reporter: Mark Miller
>            Priority: Minor
>         Attachments: Encoder.java, Formatter.java,
Highlighter.java, Highlighter.java, Highlighter.java,
Highlighter.java, Highlighter.java, HighlighterTest.java,
HighlighterTest.java, HighlighterTest.java,
HighlighterTest.java, MemoryIndex.java,
QuerySpansExtractor.java, QuerySpansExtractor.java,
QuerySpansExtractor.java, QuerySpansExtractor.java,
SimpleFormatter.java, spanhighlighter.patch,
spanhighlighter10.patch, spanhighlighter11.patch,
spanhighlighter12.patch, spanhighlighter2.patch,
spanhighlighter3.patch, spanhighlighter5.patch,
spanhighlighter6.patch, spanhighlighter7.patch,
spanhighlighter8.patch, spanhighlighter9.patch,
spanhighlighter_patch_4.zip, SpanHighlighterTest.java,
SpanHighlighterTest.java, SpanScorer.java, SpanScorer.java,
WeightedSpanTerm.java
>
>
> This patch adds a new Scorer class (SpanQueryScorer) to
the Highlighter package that scores just like QueryScorer,
but scores a 0 for Terms that did not cause the Query hit.
This gives 'actual' hit highlighting for the range of
SpanQuerys and PhraseQuery. There is also a new Fragmenter
that attempts to fragment without breaking up Spans.
> See http:
//issues.apache.org/jira/browse/LUCENE-403 for some
background.
> There is a dependency on MemoryIndex.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


------------------------------------------------------------
---------
To unsubscribe, e-mail: java-dev-unsubscribelucene.apache.org
For additional commands, e-mail: java-dev-helplucene.apache.org


Updated: (LUCENE-794) Extend contrib Highlighter to properly support phrase queries and span
country flaguser name
United States
2007-10-19 09:48:55
     [ https://issues.apache.org/jira/browse/LUCENE-794?page=com.
atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Mark Miller updated LUCENE-794:
-------------------------------

    Attachment:     (was: Encoder.java)

> Extend contrib Highlighter to properly support phrase
queries and span queries
>
------------------------------------------------------------
------------------
>
>                 Key: LUCENE-794
>                 URL: http
s://issues.apache.org/jira/browse/LUCENE-794
>             Project: Lucene - Java
>          Issue Type: Improvement
>          Components: Other
>            Reporter: Mark Miller
>            Priority: Minor
>         Attachments: Formatter.java, Highlighter.java,
Highlighter.java, Highlighter.java, Highlighter.java,
Highlighter.java, HighlighterTest.java,
HighlighterTest.java, HighlighterTest.java,
HighlighterTest.java, MemoryIndex.java,
QuerySpansExtractor.java, QuerySpansExtractor.java,
QuerySpansExtractor.java, QuerySpansExtractor.java,
SimpleFormatter.java, spanhighlighter.patch,
spanhighlighter10.patch, spanhighlighter11.patch,
spanhighlighter12.patch, spanhighlighter2.patch,
spanhighlighter3.patch, spanhighlighter5.patch,
spanhighlighter6.patch, spanhighlighter7.patch,
spanhighlighter8.patch, spanhighlighter9.patch,
spanhighlighter_patch_4.zip, SpanHighlighterTest.java,
SpanHighlighterTest.java, SpanScorer.java, SpanScorer.java,
WeightedSpanTerm.java
>
>
> This patch adds a new Scorer class (SpanQueryScorer) to
the Highlighter package that scores just like QueryScorer,
but scores a 0 for Terms that did not cause the Query hit.
This gives 'actual' hit highlighting for the range of
SpanQuerys and PhraseQuery. There is also a new Fragmenter
that attempts to fragment without breaking up Spans.
> See http:
//issues.apache.org/jira/browse/LUCENE-403 for some
background.
> There is a dependency on MemoryIndex.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue
online.


------------------------------------------------------------
---------
To unsubscribe, e-mail: java-dev-unsubscribelucene.apache.org
For additional commands, e-mail: java-dev-helplucene.apache.org


[1-10] [11-20] [21-30] [31-40] [41-50] [51-60] [61-70] [71-80] [81-83]

about | contact  Other archives ( Real Estate discussion Medical topics )