Skip to content

Latest commit

 

History

History
45 lines (38 loc) · 1.18 KB

README.md

File metadata and controls

45 lines (38 loc) · 1.18 KB

full_text_indexers

Swish++

  
  find  -iname "*.htm*" | tee /tmp/html_files.txt
  cat /tmp/html_files.txt | index++ --index-file /tmp/swish++.index --verbosity 4 --pattern 'html:*.htm*' -e 'text:*.txt' -
  cat du_files.txt  | perl -pe 's{^.*?\s+}{}g' | grep '.htm' | index++ --index-file /tmp/swish++.index --verbosity 4 --pattern 'html:*.htm*' -e 'text:*.txt' --temp-dir /media/sarnobat/ebay/trash/ -
  cd web
  search++ --max-results 1000 -i /media/sarnobat/ebay/swish++.index mysearchterm
  cd web
  search++ --dump-index

Recoll

Problem: not as attractive output format as tracker

find  -iname "*.htm*" | tee /tmp/html_files.txt
cat /tmp/html_files.txt | RECOLL_TMPDIR=/media/sarnobat/cache_recoll recollindex -if -c /media/sarnobat/cache_recoll/recoll_index/
ls ~/.recoll

recollq -c /media/sarnobat/cache_recoll/recoll_index/  'atletico -myexcludedterm mime:text/html'

Tracker

Problem: can't index individual files, only directories it seems.

  sudo apt install -y tracker
  tracker3 index
  tracker3 index --add videos/
  tracker3 search atletico

Others