Aug-23-2020, 04:03 PM
I couldn't really think of a good title for this, so, sorry!
Say I have some data like so:
The important data is held between groups like
In the above data I have tried to include all the edge cases:
- The numbers of the groups are not in any specific order. Generally they are ascending but some will be out of order
- The groups numbers are between 1 and 2 characters
- Some groups will be empty like
- One groups has a time which has a colon in it:
In the data above, all actual data between the groups is hard coded. In the PHP file this comes from, most of that data is from many variables which explain what that data is. However some of the data is just constant values like
My current approach has a class full of values like this:
The code also relies on the class having every single value in the exact order of the data. If anything was changed in the data it would break. It also involves me having these "constant" values which are unknown and so it makes the code messy because I have to create variables like
Could anyone help me out on this?
Say I have some data like so:
:1:123456:2:name:42:3:30:4::5:somerandomdata:9:8
The important data is held between groups like
:1:
. Each of those groups has a different number from the next. I need to split the string between all of those groups so I can access that data. But due to some issues with the data I'll explain, it's not as easy as just splitting.In the above data I have tried to include all the edge cases:
- The numbers of the groups are not in any specific order. Generally they are ascending but some will be out of order
- The groups numbers are between 1 and 2 characters
- Some groups will be empty like
:4::5:
- One groups has a time which has a colon in it:
:42:3:30:4:
(the time being 3:30). This will mess up most already existing split functions.In the data above, all actual data between the groups is hard coded. In the PHP file this comes from, most of that data is from many variables which explain what that data is. However some of the data is just constant values like
1
and I don't know what they do. In the data above I made :9:8
the "unknown constant".My current approach has a class full of values like this:
class DataValues: _ID = 1 NAME = 2 TIME = 42 SOMETHING = 4 DATA = 5 CONSTANT = 9Each variable is in the order like in the data. To split I do this:
s = [] _vars = [getattr(DataValues, v) for v in vars(DataValues) if not v.startswith("__")] #loop through the variables in the class for i in range(len(_vars) - 1): split = re.search("(:{}:)(.*)(:{}:)".format(_vars[i], _vars[i+1]), self.data) split = "" if split is None else split.group(2) s.append(split)This works for the most part. It has 1 issue where the regex just doesn't split the string like so
:2:Sidestep:3:VXBkYXRlOiBhZGRlZCB
(It should have split between 2 and 3). The code also relies on the class having every single value in the exact order of the data. If anything was changed in the data it would break. It also involves me having these "constant" values which are unknown and so it makes the code messy because I have to create variables like
UKNOWN_1 = 8
.Could anyone help me out on this?