Skip to content

sarnobat/full_text_indexers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 

Repository files navigation

full_text_indexers

Swish++

  
  find  -iname "*.htm*" | tee /tmp/html_files.txt
  cat /tmp/html_files.txt | index++ --index-file /tmp/swish++.index --verbosity 4 --pattern 'html:*.htm*' -e 'text:*.txt' -
  cat du_files.txt  | perl -pe 's{^.*?\s+}{}g' | grep '.htm' | index++ --index-file /tmp/swish++.index --verbosity 4 --pattern 'html:*.htm*' -e 'text:*.txt' --temp-dir /media/sarnobat/ebay/trash/ -
  cd web
  search++ --max-results 1000 -i /media/sarnobat/ebay/swish++.index mysearchterm
  cd web
  search++ --dump-index

Recoll

Problem: not as attractive output format as tracker

find  -iname "*.htm*" | tee /tmp/html_files.txt
cat /tmp/html_files.txt | RECOLL_TMPDIR=/media/sarnobat/cache_recoll recollindex -if -c /media/sarnobat/cache_recoll/recoll_index/
ls ~/.recoll

recollq -c /media/sarnobat/cache_recoll/recoll_index/  'atletico -myexcludedterm mime:text/html'

Tracker

Problem: can't index individual files, only directories it seems.

  sudo apt install -y tracker
  tracker3 index
  tracker3 index --add videos/
  tracker3 search atletico

Others

About

No description or website provided.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published