We have learned several string operations in our previous blogs. Proceeding further we are going to work on some very interesting and useful concepts of text preprocessing using NLTK in Python. Let us first understand the text processing thought process by observing the following text to work on. sample_text = '''A nuclear power plant is a thermal power station in which the heat source is a nuclear reactor. As is typical of thermal power stations, heat is usssded to generate steam that drives a steam turbine connected to aaaaa generator that produces electricity. As of 2018, the International Atomic Enertgy Agency reported ther were 450 nuclear power reactors in operation in 30 countries.''' If you would want to do text preprocessing using NLTK in Python, what all steps pop up in our head? How do you think a text is processed? Whether a whole document is processed at once? Or it is broken down into individual words? Do you think the words like “of”, “the”, “to” add any value in our text analysis? Do these words provide us with any information? What can be done about these words? Can you spot some incorrectly spelled words? Would you like to correct […]
The post Text preprocessing using NLTK in Python appeared first on Ivy Professional School | Official Blog.