sciwing.preprocessing¶
instance_preprocessing¶
-
class
sciwing.preprocessing.instance_preprocessing.
InstancePreprocessing
¶ Bases:
object
This class implements some common pre-processing that may be applied on instances which are List[str]. For example, you can remove stop words, convert the word into lower case and others. Most of the methods here accept an instance and return an instance
-
static
indicate_capitalization
(instance: List[str]) → List[str]¶ Indicates whether every word is all small, all caps or captialized
Parameters: instance (List[str]) – A list of tokens Returns: Strings indicating capitalization Return type: List[str]
-
static
lowercase
(instance: List[str]) → List[str]¶
-
remove_stop_words
(instance: List[str]) → List[str]¶ Remove stop words if they are present We will use stop-words package from pip https://github.com/Alir3z4/python-stop-words
Parameters: instance (List[str]) – The list of tokens Returns: The instance with stop words removed Return type: List[str]
-
static