StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

PO
primarykey
Id
12964210
data
AcceptedAnswerId
0
AnswerCount
0
ClosedDate
CommentCount
1
CommunityOwnedDate
CreationDate
2012-10-18T21:42:07.900
FavoriteCount
0
LastActivityDate
2012-10-18T21:42:07.900
LastEditDate
LastEditorUserId
0
OwnerUserId
935868
ParentId
12961517
PostTypeId
2
Score
2
ViewCount
0
LastEditorDisplayName
text
Body
While separating the individual speakers is quite a difficult problem you can automatically split the audio where there are pauses. This would produce a series of files that would likely be easier to manage since speakers often alternate between pauses. This approach requires the open source Julius speech recognition decoder package. This is available in many Linux package repositories. I use the Ubuntu multiverse repository. Here is the site: <a href="http://julius.sourceforge.jp/en_index.php" rel="nofollow">http://julius.sourceforge.jp/en_index.php</a> <hr> Step 0: Install Julius <pre><code>sudo apt-get install julius </code></pre> Step 1: Segment Audio <pre><code>adintool -in file -out file -filename myRecording.wav -startid 0 -freq 44100 -lv 2048 -zc 30 -headmargin 600 -tailmargin 600 </code></pre> <ul> <li>-startid is the starting segment number that will be appended to the filename</li> <li>-freq is the sample rate of the source audio file</li> <li>-lv is the level of the audio above which voice detection will be active</li> <li>-zc is the zero crossings above which voice detection will be active</li> <li>-headmargin and -tailmargin is the amount of silence before and after each audio segment</li> </ul> Note that -lv and -zc will have to be adjusted for your particular audio recording's attributes while -headmargin and -tailmargin will have to be adjusted for your particular speaker's styles. But the values given above have worked well for my voice recordings in the past. Here is the documentation: <a href="http://julius.sourceforge.jp/juliusbook/en/adintool.html" rel="nofollow">http://julius.sourceforge.jp/juliusbook/en/adintool.html</a> <hr> In my experience preprocessing the audio using compression and normalization gives better results and requires less adjustment of the Julius arguments. These initial steps are recommended but not required. This approach requires the open source SoX audio toolkit package. This is also available in many Linux package repositories. I use the Ubuntu universe repository. Here is the site: <a href="http://sox.sourceforge.net" rel="nofollow">http://sox.sourceforge.net</a> <hr> Step -2: Install SoX <pre><code>sudo apt-get install sox </code></pre> Step -1: Preprocess Audio <pre><code>sox myOriginalRecording.wav myRecording.wav gain -b -n -8 compand 0.2,0.6 4:-48,-32,-24 0 -64 0.2 gain -b -n -2 </code></pre> <ul> <li>gain -b -n balances and normalizes the audio to a given level</li> <li>compand compresses (in this case) the audio based on the parameters</li> </ul> Note that compand may require some time to completely understand the parameters. But the values given above have worked well for my voice recordings in the past. Here is the documentation: <a href="http://sox.sourceforge.net/sox.html" rel="nofollow">http://sox.sourceforge.net/sox.html</a> <hr> While this will not give you identification of each speaker it will greatly simplify the task of doing it by ear, which may end up being the only option for a while. But I do hope you find practical solution if it is already available.
Tags
Title
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. POhow to separate an audio file based on different speakers
 singulars
 PostTypePostTypeId
 PTQuestion
PostTypePostTypeId
1. PTAnswer
UserLastEditorUserId
1. This table or related slice is empty.
UserOwnerUserId
1. USKelly Christoffersen
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. POhow to separate an audio file based on different speakers
 singulars
 PostTypePostTypeId
 PTQuestion
PostsParentIdCreationDate
1. This table or related slice is empty.
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTAcceptedByOriginator
2. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTDownMod
3. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId
1. COThank you for your approach
 singulars
 PostPostId
 PO
 UserUserId
 USBo Liu

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.