-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add details to README.md about how exactly this extension works #13
Comments
I implemented this a long time ago (over 9 years ago), and don't recall the details. I browsed the code to review the algorithm. A sentence's importance is calculated by assigning a score for each word in the sentence, and summing the scores. A word's score is based on its frequency throughout the document (higher scores for higher frequency). The score of long sentences is reduced, to account for having a higher score from more words. Lines 1067 to 1186 in 3bf1319
|
At the moment, the information provided is not enough to understand what exactly is meant by the wording "the important content", and based on what criteria this content will be searched for in the text of web pages.
The text was updated successfully, but these errors were encountered: