Datafari Help

Basic Search

Basic Search allows you to specify a query that will be searching into the title, path and content of the indexed documents. Words specified in the query are searched altogether. To search for terms individually, use the advanced search or a boolean operator (see below). Basic Search is done through the search bar that is found on the home page and search page of Datafari.

An autocompletion mechanism will suggest the terms available in the index, proposing terms that start with the letters that you have already typed to help you complete you query

Basic Search supports the usage of boolean operators such as AND or OR to craft more refined queries. More on that later in this help page.

If you type in something incorrectly, or if a search query generates very few results, Datafari will suggest another query that may generate more results.

Advanced Search

For a desired query, Advanced Search allows you to specify in which parts of the indexed documents you want to search the terms. You can add more criteria by pressing the "+" button at the bottom of the Advanced Search page. Among the available options, the most notables are:

Source:
The source from which the document has been indexed. This is dependent on the configuration that has been used in the indexation jobs.
title:
Allow to search in the titles of the documents
url:
Allows to search in the path of the files. Might be useful as an aditional criteria when you know the file should be in a particular subtree.
content_en / content_fr:
Allows to search in the content of the file, either french or english specifically.
Extension :
Allows to filter search results to a specific extension (doc, pdf, msg, ...).
File size :
Allows to specify an interval for the size of the files.

Other options exist and you can find them in the "select a field" dropdown in the advanced search.

Using operators :

You can craft more complex queries using boolean and unary operators. Below is a description of the most common ones :

AND (ou &&): Requires that the terms on both sides of the operator are presents in a document for it to be part of the results.
This is the default operator in any search you make. It can thus be ommited. "Solar energy" is the same as "solar AND energy" for exemple.

OR : only one of the terms around the operator is needed for a document to be part of the results.

NOT (or -): the term following this operator must NOT be present in the document for the document to be part of the results.

+: the term following this operator MUST be part of the document for the document to be part of the results.

The wildcard * is also allowed, which triggers a search for any word that starts with the same characters as the ones declared in the query.
For instance, searching ret* will return documents containing words such as return, retired, retreat.

Examples and more operators are presented here:
https://lucene.apache.org/solr/guide/7_6/the-standard-query-parser.html#the-standard-query-parser

Facets:

On the left of the results page, you have a set of filter options called facets. When you click on one of the options, the results are automatically refreshed to match your selection, applying a filter to the current search.

For example, when clicking the "doc" checkbox in the "extension" facet, only .doc documents will be returned as results. This allows you to refine your search and obtain more relevant results.