Skip to content

Instantly share code, notes, and snippets.

@robintux
Last active October 26, 2025 19:47
Show Gist options
  • Select an option

  • Save robintux/c57cca0199f1c301840d5f674633c837 to your computer and use it in GitHub Desktop.

Select an option

Save robintux/c57cca0199f1c301840d5f674633c837 to your computer and use it in GitHub Desktop.
stop_words = {
'i', 'me', 'my', 'myself', 'we', 'our', 'ours', 'ourselves', 'you', 'your', 'yours',
'yourself', 'yourselves', 'he', 'him', 'his', 'himself', 'she', 'her', 'hers',
'herself', 'it', 'its', 'itself', 'they', 'them', 'their', 'theirs', 'themselves',
'what', 'which', 'who', 'whom', 'this', 'that', 'these', 'those', 'am', 'is', 'are',
'was', 'were', 'be', 'been', 'being', 'have', 'has', 'had', 'having', 'do', 'does',
'did', 'doing', 'a', 'an', 'the', 'and', 'but', 'if', 'or', 'because', 'as', 'until',
'while', 'of', 'at', 'by', 'for', 'with', 'through', 'during', 'before', 'after',
'above', 'below', 'up', 'down', 'in', 'out', 'on', 'off', 'over', 'under', 'again',
'further', 'then', 'once', 'here', 'there', 'when', 'where', 'why', 'how', 'all',
'any', 'both', 'each', 'few', 'more', 'most', 'other', 'some', 'such', 'no', 'nor',
'not', 'only', 'own', 'same', 'so', 'than', 'too', 'very', 's', 't', 'can', 'will',
'just', 'don', 'should', 'now'
}
@robintux
Copy link
Author

ACADEMIC_TERMS = {
'data', 'analysis', 'method', 'model', 'theory', 'result', 'study', 'research',
'system', 'problem', 'solution', 'process', 'approach', 'technique', 'algorithm',
'function', 'structure', 'property', 'behavior', 'performance', 'evaluation',
'experiment', 'simulation', 'application', 'development', 'implementation',
'framework', 'mechanism', 'phenomenon', 'characteristic', 'parameter',
'variable', 'equation', 'formula', 'theorem', 'lemma', 'corollary', 'proof',
'hypothesis', 'conjecture', 'definition', 'proposition', 'axiom', 'principle'
}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment