Python Forum
Removing timestamps from transcriptions
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Removing timestamps from transcriptions
#1
There are various files of transcriptions and they have timestamps in them. They look like this ..

Quote:So from 12:23 that very moment you actually 12:25 actually continued the form of a choice, 12:28 that you kept going to the point of 12:30 no return, where you actually became who 12:33 you are now. 12:35 So you actually took part in 12:38 who you are now.

and we need them to look like this ..

Quote:So from that very moment you actually actually continued the form of a choice, that you kept going to the point of no return, where you actually became who you are now. So you actually took part in who you are now.

Is it simply a matter of a search and replace ? Like search for either numeric or a colon, and replace with a null ? Wanted to use python to parse through a number of files; there are about 20 of these .txt files and some are 130K. So doing it manually is out of the question.

Possibly search for a space and numeric to indicate the start of where it needs replacing, and more often than not we have .

Quote:access 11:30 that?

to become .

Quote:access that?

so there is usually that preceeding space to be replaced with null also.
Reply


Messages In This Thread
Removing timestamps from transcriptions - by jehoshua - Dec-05-2018, 03:17 AM

Possibly Related Threads…
Thread Author Replies Views Last Post
  How to find tags using specific text (timestamps) in a url? q988988 1 1,387 Mar-08-2022, 08:09 AM
Last Post: buran
  Speech Recognition with timestamps DeanAseraf1 3 6,625 Jun-27-2021, 06:58 PM
Last Post: gh_ad
Bug Help on Flagging Timestamps Daring_T 2 1,880 Oct-28-2020, 08:11 PM
Last Post: Daring_T
  How to compare timestamps in python asad 2 9,119 Oct-24-2018, 03:56 AM
Last Post: asad

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020