#include <weight.h>
Inheritance diagram for Xapian::Weight:
Public Member Functions | |
virtual | ~Weight () |
virtual std::string | name () const=0 |
virtual std::string | serialise () const=0 |
virtual Weight * | unserialise (const std::string &s) const=0 |
virtual Xapian::weight | get_sumpart (Xapian::termcount wdf, Xapian::termcount doclen) const =0 |
virtual Xapian::weight | get_maxpart () const=0 |
virtual Xapian::weight | get_sumextra (Xapian::termcount doclen) const =0 |
virtual Xapian::weight | get_maxextra () const=0 |
Protected Types | |
enum | stat_flags { COLLECTION_SIZE = 1, RSET_SIZE = 2, AVERAGE_LENGTH = 4, TERMFREQ = 8, RELTERMFREQ = 16, QUERY_LENGTH = 32, WQF = 64, WDF = 128, DOC_LENGTH = 256, DOC_LENGTH_MIN = 512, DOC_LENGTH_MAX = 1024, WDF_MAX = 2048 } |
Stats which the weighting scheme can use (see need_stat()). More... | |
Protected Member Functions | |
void | need_stat (stat_flags flag) |
virtual void | init (double factor)=0 |
Weight (const Weight &) | |
Only allow subclasses to copy us. | |
Weight () | |
Default constructor, needed by subclass constructors. | |
Xapian::doccount | get_collection_size () const |
The number of documents in the collection. | |
Xapian::doccount | get_rset_size () const |
The number of documents marked as relevant. | |
Xapian::doclength | get_average_length () const |
The average length of a document in the collection. | |
Xapian::doccount | get_termfreq () const |
The number of documents which this term indexes. | |
Xapian::doccount | get_reltermfreq () const |
The number of relevant documents which this term indexes. | |
Xapian::termcount | get_query_length () const |
The length of the query. | |
Xapian::termcount | get_wqf () const |
The within-query-frequency of this term. | |
Xapian::termcount | get_doclength_upper_bound () const |
Xapian::termcount | get_doclength_lower_bound () const |
Xapian::termcount | get_wdf_upper_bound () const |
enum Xapian::Weight::stat_flags [protected] |
Stats which the weighting scheme can use (see need_stat()).
virtual Xapian::Weight::~Weight | ( | ) | [virtual] |
Virtual destructor, because we have virtual methods.
Xapian::Weight::Weight | ( | const Weight & | ) | [protected] |
Only allow subclasses to copy us.
Xapian::Weight::Weight | ( | ) | [inline, protected] |
Default constructor, needed by subclass constructors.
void Xapian::Weight::need_stat | ( | stat_flags | flag | ) | [inline, protected] |
Tell Xapian that your subclass will want a particular statistic.
Some of the statistics can be costly to fetch or calculate, so Xapian needs to know which are actually going to be used. You should call need_stat() from your constructor for each such statistic.
flag | The stat_flags value for a required statistic. |
virtual void Xapian::Weight::init | ( | double | factor | ) | [protected, pure virtual] |
Allow the subclass to perform any initialisation it needs to.
factor | Any scaling factor (e.g. from OP_SCALE_WEIGHT). |
virtual std::string Xapian::Weight::name | ( | ) | const [pure virtual] |
Return the name of this weighting scheme.
This name is used by the remote backend. It is passed with the serialised parameters to the remote server so that it knows which class to create.
Return the full namespace-qualified name of your class here - if your class is called FooWeight, return "FooWeight" from this method (Xapian::BM25Weight returns "Xapian::BM25Weight" here).
If you don't want to support the remote backend in your weighting scheme, you can just implement this to throw Xapian::UnimplementedError.
Implemented in Xapian::BoolWeight, Xapian::BM25Weight, and Xapian::TradWeight.
virtual std::string Xapian::Weight::serialise | ( | ) | const [pure virtual] |
Return this object's parameters serialised as a single string.
If you don't want to support the remote backend in your weighting scheme, you can just implement this to throw Xapian::UnimplementedError.
Implemented in Xapian::BoolWeight, Xapian::BM25Weight, and Xapian::TradWeight.
virtual Weight* Xapian::Weight::unserialise | ( | const std::string & | s | ) | const [pure virtual] |
Unserialise parameters.
This method unserialises parameters serialised by the serialise() method and allocates and returns a new object initialised with them.
If you don't want to support the remote backend in your weighting scheme, you can just implement this to throw Xapian::UnimplementedError.
Note that the returned object will be deallocated by Xapian after use with "delete". It must therefore have been allocated with "new".
Implemented in Xapian::BoolWeight, Xapian::BM25Weight, and Xapian::TradWeight.
virtual Xapian::weight Xapian::Weight::get_sumpart | ( | Xapian::termcount | wdf, | |
Xapian::termcount | doclen | |||
) | const [pure virtual] |
Calculate the weight contribution for this object's term to a document.
The parameters give information about the document which may be used in the calculations:
wdf | The within document frequency of the term in the document. | |
doclen | The document's length (unnormalised). |
Implemented in Xapian::BoolWeight, Xapian::BM25Weight, and Xapian::TradWeight.
virtual Xapian::weight Xapian::Weight::get_maxpart | ( | ) | const [pure virtual] |
Return an upper bound on what get_sumpart() can return for any document.
This information is used by the matcher to perform various optimisations, so strive to make the bound as tight as possible.
Implemented in Xapian::BoolWeight, Xapian::BM25Weight, and Xapian::TradWeight.
virtual Xapian::weight Xapian::Weight::get_sumextra | ( | Xapian::termcount | doclen | ) | const [pure virtual] |
Calculate the term-independent weight component for a document.
The parameter gives information about the document which may be used in the calculations:
doclen | The document's length (unnormalised). |
Implemented in Xapian::BoolWeight, Xapian::BM25Weight, and Xapian::TradWeight.
virtual Xapian::weight Xapian::Weight::get_maxextra | ( | ) | const [pure virtual] |
Return an upper bound on what get_sumextra() can return for any document.
This information is used by the matcher to perform various optimisations, so strive to make the bound as tight as possible.
Implemented in Xapian::BoolWeight, Xapian::BM25Weight, and Xapian::TradWeight.
Xapian::doccount Xapian::Weight::get_collection_size | ( | ) | const [inline, protected] |
The number of documents in the collection.
Xapian::doccount Xapian::Weight::get_rset_size | ( | ) | const [inline, protected] |
The number of documents marked as relevant.
Xapian::doclength Xapian::Weight::get_average_length | ( | ) | const [inline, protected] |
The average length of a document in the collection.
Xapian::doccount Xapian::Weight::get_termfreq | ( | ) | const [inline, protected] |
The number of documents which this term indexes.
Xapian::doccount Xapian::Weight::get_reltermfreq | ( | ) | const [inline, protected] |
The number of relevant documents which this term indexes.
Xapian::termcount Xapian::Weight::get_query_length | ( | ) | const [inline, protected] |
The length of the query.
Xapian::termcount Xapian::Weight::get_wqf | ( | ) | const [inline, protected] |
The within-query-frequency of this term.
Xapian::termcount Xapian::Weight::get_doclength_upper_bound | ( | ) | const [inline, protected] |
An lower bound on the maximum length of any document in the database.
This should only be used by get_maxpart() and get_maxextra().
Xapian::termcount Xapian::Weight::get_doclength_lower_bound | ( | ) | const [inline, protected] |
An upper bound on the maximum length of any document in the database.
This should only be used by get_maxpart() and get_maxextra().
Xapian::termcount Xapian::Weight::get_wdf_upper_bound | ( | ) | const [inline, protected] |
An upper bound on the wdf of this term.
This should only be used by get_maxpart() and get_maxextra().