Python Forum
Delete Lines that Contain Words - Loop through files in a folder - Write to new files
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Delete Lines that Contain Words - Loop through files in a folder - Write to new files
#1
Hi friends,

I am trying to loop through files and from each file delete the lines that contain the search keywords




import os

def remove_line_from_file(filename, line_to_remove, dirpath=''):
  
    filename = os.path.join(dirpath, filename)
    temp_path = os.path.join(dirpath, 'temp.txt')


    
    with open(filename, 'r') as f_read, open(temp_path, 'w') as temp:


        search_keywords=['Car','Train']                # Delete all sentences that contain these words
        
        for line in f_read:
            if (any(map(lambda word: word in line, search_keywords))):
                    
                if line.strip() == word:            #f line.strip() == line_to_remove:
                    continue
                temp.write(line)

        os.remove(filename)
        os.rename(temp_path, filename)
        directory = 'C:/Users/Home/Desktop/test/'
        dirpath, _, files = next(os.walk(directory))

        for f in files:
            remove_line_from_file(f, line, dirpath)
I dont get an error as such - but I cant work out why it doesnt work.

It may not be syntactically correct as i borrowed code from here and there.

Please may a kind person have a look at this

thank you for your time and help



:)


Python newbie trying to learn the ropes
Reply
#2
#!/usr/bin/python3
import os

def remove_line_from_file(filename, dirpath):
    filename  = os.path.join(dirpath, filename)
    temp_path = os.path.join(dirpath, filename + '.tmp')
    search_keywords = ['Car','Train']

    with open(filename, 'r') as f_read, open(temp_path, 'w') as temp:
        for line in f_read:
            found = False
            for word in search_keywords:
                if word in line:
                    found = True
            if not found:
                temp.write(line)

    #os.rename(temp_path, filename)

#main
directory = './'
for root, dirs, files in os.walk(directory):
    for f in files:
        remove_line_from_file(f, directory)
#done
Reply
#3
Thank you my dear friend heiner55.

I am so happy because I have been trying for days to work out the correct syntax for this.
I guess i didnt need the lambda thing
if (any(map(lambda word: word in line, search_keywords))):

Simplicity is the best

My files have so much garbage and it was very hard for me to find these line and manually delete them one after the other Wall

Thank you so much again for lending your programming eyes and time to help me.

Python hero!

I really apreciate it so much

Hope you have a great week! Big Grin

:)



:)


Python newbie trying to learn the ropes
Reply
#4
Glad to hear.
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  Is it possible to extract 1 or 2 bits of data from MS project files? cubangt 8 940 Feb-16-2024, 12:02 AM
Last Post: deanhystad
  File loop curiously skipping files - FIXED mbk34 10 683 Feb-10-2024, 07:08 AM
Last Post: buran
  Copy Paste excel files based on the first letters of the file name Viento 2 347 Feb-07-2024, 12:24 PM
Last Post: Viento
  Class test : good way to split methods into several files paul18fr 4 403 Jan-30-2024, 11:46 AM
Last Post: Pedroski55
  Compare folder A and subfolder B and display files that are in folder A but not in su Melcu54 3 466 Jan-05-2024, 05:16 PM
Last Post: Pedroski55
  uploading files from a ubuntu local directory to Minio storage container dchilambo 0 398 Dec-22-2023, 07:17 AM
Last Post: dchilambo
  Upload Files to Azure Storage Container phillyfa 6 578 Dec-22-2023, 06:11 AM
Last Post: Pedroski55
  merge all xlsb files into csv mg24 0 303 Nov-13-2023, 08:25 AM
Last Post: mg24
  Newbie question about switching between files - Python/Pycharm Busby222 3 543 Oct-15-2023, 03:16 PM
Last Post: deanhystad
  Move Files based on partial Match mohamedsalih12 2 744 Sep-20-2023, 07:38 PM
Last Post: snippsat

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020