From theory to practice

bertibott · (This post was last modified: Feb-24-2017, 03:59 PM by bertibott.)

okay... so re-reading the numpy array documentation I get that it actually has a field for the data type which in this case is int16.. so 16bit signed integer.

I was a bit hung up on this:

Output:[[-1 -2]
 [ 1  1]
 [-4 -3]
 ..., 
 [ 4 -2]
 [-4  2]
 [ 4 -1]]
(44100, array([[-1, -2],
       [ 1,  1],
       [-4, -3],
       ..., 
       [ 4, -2],
       [-4,  2],
       [ 4, -1]], dtype=int16))

first one is printing out "audio" and the other is printing out "original" from the code of the original post... one is with a lot of commas... (the way I expected it to be after reading the manual) and the other isn't... just drops one in the middle where it doesn't really belong.

Okay... so now that I have an actual idea of how the data is represented I can think about altering the way I want to.

First I wanted to have it converted to mono... so generally you take the left and right samples and average them:

length = audio.shape[0]
mono = (audio[0:length:1, 0]+audio[0:length:1, 1])/2

Now there are two things I am wondering about:
The shape of the new array is this:

Output:
(210331,)

This means it's still a n by 2 array, right?

And secondly what is the best way to convert the datatype into float? So that I can work with a range of values from -1 to 1.

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	.xls processing with Pandas - my struggle moving from theory to practical use	prolle	0	1,584	May-21-2020, 06:57 PM Last Post: prolle
	Help with Data Match Theory	randor	2	2,065	Dec-25-2019, 05:57 PM Last Post: randor

From theory to practice

User Panel Messages

Announcements