full-text-search
How to evaluate hosted full text search solutions?
Websolr provides a cloud-based Solr with a control panel. It’s in private beta as of this writing, but you can get the service through Heroku. Another hosted Solr service is PowCloud, also in private beta, which seems to offer strong WordPress integration. SolrHQ: another beta service providing a hosted Solr solution, with Joomla and WordPress … Read more
How can I manipulate MySQL fulltext search relevance to make one field more ‘valuable’ than another?
Create three full text indexes a) one on the keyword column b) one on the content column c) one on both keyword and content column Then, your query: SELECT id, keyword, content, MATCH (keyword) AGAINST (‘watermelon’) AS rel1, MATCH (content) AGAINST (‘watermelon’) AS rel2 FROM table WHERE MATCH (keyword,content) AGAINST (‘watermelon’) ORDER BY (rel1*1.5)+(rel2) DESC … Read more
PostgreSQL: Full Text Search – How to search partial words?
Try, SELECT title FROM movies WHERE to_tsvector(title) @@ to_tsquery(‘squire:*’) This works on PostgreSQL 8.4+
Performance of like ‘%Query%’ vs full text search CONTAINS query
Full Text Searching (using the CONTAINS) will be faster/more efficient than using LIKE with wildcarding. Full Text Searching (FTS) includes the ability to define Full Text Indexes, which FTS can use. I don’t know why you wouldn’t define a FTS index if you intended to use the functionality. LIKE with wildcarding on the left side … Read more
Searching for Text within Oracle Stored Procedures
SELECT * FROM ALL_source WHERE UPPER(text) LIKE ‘%BLAH%’ EDIT Adding additional info: SELECT * FROM DBA_source WHERE UPPER(text) LIKE ‘%BLAH%’ The difference is dba_source will have the text of all stored objects. All_source will have the text of all stored objects accessible by the user performing the query. Oracle Database Reference 11g Release 2 (11.2) … Read more
Is there a pure Python Lucene?
Whoosh is a new project which is similar to lucene, but is pure python.
PostgreSQL Full Text Search and Trigram Confusion
They serve very different purposes. Full Text Search is used to return documents that match a search query of stemmed words. Trigrams give you a method for comparing two strings and determining how similar they look. Consider the following examples: SELECT ‘cat’ % ‘cats’; –true The above returns true because ‘cat’ is quite similar to … Read more
Best practices for searchable archive of thousands of documents (pdf and/or xml)
In summary: I’m going to be recommending ElasticSearch, but let’s break the problem down and talk about how to implement it: There are a few parts to this: Extracting the text from your docs to make them indexable Making this text available as full text search Returning highlighted snippets of the doc Knowing where in … Read more
How to search multiple columns in MySQL?
If it is just for searching then you may be able to use CONCATENATE_WS. This would allow wild card searching. There may be performance issues depending on the size of the table. SELECT * FROM pages WHERE CONCAT_WS(”, column1, column2, column3) LIKE ‘%keyword%’