Need Help with this peace of code.PLease Exlain Each line of the code.

AkashDubey · Mar-08-2018, 03:18 PM

#find sentences containing HTML tags
i=0;
for sent in final['Text'].values:
    if (len(re.findall('<.*?>', sent))):
        print(i)
        print(sent)
        break;
    i += 1;

AkashDubey · Mar-08-2018, 03:20 PM

import re
import string
from nltk.corpus import stopwords
from nltk.stem import PorterStemmer
from nltk.stem.wordnet import WordNetLemmatizer

stop = set(stopwords.words('english')) #set of stopwords
sno = nltk.stem.SnowballStemmer('english') #initialising the snowball stemmer

def cleanhtml(sentence): #function to clean the word of any html-tags
    cleanr = re.compile('<.*?>')
    cleantext = re.sub(cleanr, ' ', sentence)
    return cleantext
def cleanpunc(sentence): #function to clean the word of any punctuation or special characters
    cleaned = re.sub(r'[?|!|\'|"|#]',r'',sentence)
    cleaned = re.sub(r'[.|,|)|(|\|/]',r' ',cleaned)
    return  cleaned
print(stop)
print('************************************')
print(sno.stem('tasty'))

AkashDubey · Mar-08-2018, 03:22 PM

#Code for implementing step-by-step the checks mentioned in the pre-processing phase
# this code takes a while to run as it needs to run on 500k sentences.
i=0
str1=' '
final_string=[]
all_positive_words=[] # store words from +ve reviews here
all_negative_words=[] # store words from -ve reviews here.
s=''
for sent in final['Text'].values:
    filtered_sentence=[]
    #print(sent);
    sent=cleanhtml(sent) # remove HTMl tags
    for w in sent.split():
        for cleaned_words in cleanpunc(w).split():
            if((cleaned_words.isalpha()) & (len(cleaned_words)>2)):    
                if(cleaned_words.lower() not in stop):
                    s=(sno.stem(cleaned_words.lower())).encode('utf8')
                    filtered_sentence.append(s)
                    if (final['Score'].values)[i] == 'positive': 
                        all_positive_words.append(s) #list of all words used to describe positive reviews
                    if(final['Score'].values)[i] == 'negative':
                        all_negative_words.append(s) #list of all words used to describe negative reviews reviews
                else:
                    continue
            else:
                continue 
    #print(filtered_sentence)
    str1 = b" ".join(filtered_sentence) #final string of cleaned words
    #print("***********************************************************************")
    
    final_string.append(str1)
    i+=1

***snippsat*** · (This post was last modified: Mar-08-2018, 03:49 PM by snippsat.)

I have merge your 3 threads,it was a close call with delete Dodgy

We are not gone explain what each line dos.
You have to do the effort,it there are certain lines you wonder about ask about them.

AkashDubey · (This post was last modified: Mar-08-2018, 04:29 PM by AkashDubey.)

Then, Please give an idea of what all the above three code snippets do briefly explaining each code snippet in detail.

(Mar-08-2018, 03:49 PM)snippsat Wrote: I have merge your 3 threads,it was a close call with delete
We are not gone explain what each line dos.
You have to do the effort,it there are certain lines you wonder about ask about them.

Then, Please give an idea of what all the above three code snippets do briefly explaining each code snippet in detail.

***sparkz_alot*** · Mar-08-2018, 06:55 PM

The code is already very well documented as to what it does. As was pointed out, we are not going to explain the purpose of each line. You might want to start with finding a tutorial for Python beginners to learn the basics of the language.

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	I am getting an IndentError on my python code in VS Code and i dont know why	jcardenas1980	11	4,079	Mar-22-2025, 09:49 AM Last Post: Pedroski55
	Merge Python code with Micro Python code?	adzy	2	957	Jul-03-2024, 11:41 AM Last Post: kkinder
	Printing the code line number	arbiel	6	1,632	Jun-30-2024, 08:01 AM Last Post: arbiel
	Unable to understand the meaning of the line of code.	jahuja73	0	1,028	Jan-23-2024, 05:09 AM Last Post: jahuja73
	My code works on Jupyter Lab/Notebook, but NOT on Visual Code Editor	jst	4	4,289	Nov-15-2023, 06:56 PM Last Post: jst
	Trying to loop through code to plot seaborn line plots across multiple subplots	eyavuz21	0	2,792	Dec-05-2022, 10:46 AM Last Post: eyavuz21
	python multiple try except block in my code -- can we shorten code	mg24	10	14,911	Nov-10-2022, 12:48 PM Last Post: DeaD_EyE
	faster code for my code	kucingkembar	19	6,452	Aug-09-2022, 09:48 AM Last Post: DPaul
	Python code to read second line from CSV files and create a master CSV file	sh1704	1	3,499	Feb-13-2022, 07:13 PM Last Post: menator01
	how long can a line of code be?	Skaperen	2	2,977	Jun-09-2021, 06:31 PM Last Post: Skaperen

Need Help with this peace of code.PLease Exlain Each line of the code.

User Panel Messages

Announcements