Python Forum
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
String extraction
#2
>>> my_string = 'grant-perice, Mr. Owen Harris'
>>> title, name = my_string.split(',')
>>> title
'grant-perice'
>>> name.strip()
'Mr. Owen Harris'
>>> name.split('.')[-1].strip()
'Owen Harris'
>>> title, name = my_string.split('.')
>>> name.strip()
'Owen Harris'
>>> title
'grant-perice, Mr'
of course, you can also use RegEx, check re module

or you can install third party package nameparser
>>> import nameparser
>>> name = HumanName('grant-perice, Mr. Owen Harris')
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
NameError: name 'HumanName' is not defined
>>> name = nameparser.HumanName('grant-perice, Mr. Owen Harris')
>>> name
<HumanName : [
	title: 'Mr.' 
	first: 'Owen' 
	middle: 'Harris' 
	last: 'grant-perice' 
	suffix: ''
	nickname: ''
]>
>>> name = nameparser.HumanName('Mr. Owen Harris')
>>> name
<HumanName : [
	title: 'Mr.' 
	first: 'Owen' 
	middle: '' 
	last: 'Harris' 
	suffix: ''
	nickname: ''
]>
If you can't explain it to a six year old, you don't understand it yourself, Albert Einstein
How to Ask Questions The Smart Way: link and another link
Create MCV example
Debug small programs

Reply


Messages In This Thread
String extraction - by Scott - Jul-21-2018, 07:30 AM
RE: String extraction - by buran - Jul-21-2018, 07:43 AM
RE: String extraction - by Scott - Jul-21-2018, 10:49 AM
RE: String extraction - by buran - Jul-21-2018, 09:01 PM

Possibly Related Threads…
Thread Author Replies Views Last Post
  eml file data extraction ajetrumpet 2 2,708 Jul-04-2020, 04:34 AM
Last Post: ajetrumpet
  Table extraction from scanned PDF RupamKundu 1 3,745 Aug-03-2019, 02:59 AM
Last Post: Larz60+
  Json value extraction aaronwarwick 1 2,157 Jun-24-2019, 07:23 PM
Last Post: micseydel
  Substring extraction nevendary 6 4,026 Apr-24-2019, 05:41 AM
Last Post: nevendary
  Automating a Data Extraction Process Harrison 12 8,741 Mar-31-2017, 10:44 AM
Last Post: Harrison

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020