Terms and frequencies?
After running a query and getting the results back, is it possible to get the terms and frequencies for each document?
Thanks,
Eric
After running a query and getting the results back, is it possible to get the terms and frequencies for each document?
Thanks,
Eric
I may use the word-docNo as the HashTable Key,
And count the tf
I think that you are doing some IR jobs..
It describes all of the columns that are defined in WDS 2.6.5. Note that there is a column called "contents", but it is not retrievable.
I see no definition for any column that contains the list of indexed terms for the document. What I am asking is: "Is there a way to retrieve a list of indexed terms for the documents that are returned as a part of a query?"
I don't want to have to go open each file and essentially duplicate the work that the indexing engine has already accomplished.
Eric
Hello E Robinson,
There currently isn't a way to retrieve this information. You can query on the collumn Characterization (System.Search.AutoSummary in 3.0), but that will only give you the terms from the beginning of the document.
If you want access to all of the content you can instantiate and invoke the IFilter to grab the content stream again.
Paul Nystrom - MSFT