Soxを使用して多くのオーディオファイルのスペクトログラムを効率的に作成する方法

Question

大量のオーディオファイルがあり、Soxを使用して個々のファイルのスペクトログラムを作成したいと考えています。通常、単一のファイルの場合、これを行います：

sox audiofile.flac -n spectrogram

ただし、この方法を複数のファイルに拡張する方法がわかりません。理想的には、出力を希望します.png fileそれぞれのオーディオファイルにファイル名を関連付けます。例えば audiofile1.png ために audiofile1.flac、audiofile2.png ために audiofile2.flac 等々。

誰かがこれを行う方法を知っていますか？

Harry Williams · Answer

ジョセフに答えてくれてありがとう。多分それは彼の投稿の時にうまくいったが、私はsoxがコマンドを受け入れるためにspectrogramの直後に-oを追加しなければならなかった。

for file in *.flac;do outfile="${file%.*}.png" sox "$file" -n spectrogram -o "$outfile" done

私はそれらすべてをアーカイブのために独自のフォルダに入れます。

画像内のスペクトログラムの上にファイルのタイトルを追加し、少し幅を広げて詳細を表示することで、さらに先に進むこともできます。デフォルトの画像は私には少し小さいです。

for file in *.flac;do outfile="${file%.*}.png" title_in_pic="${file%.*}" sox "$file" -n spectrogram -t "$title_in_pic" -o "$outfile" -x 2000 done

Joseph R. · Answer

コマンドをループで囲むことができます。

for file in *.flac do outfile="${file%.*}.png" sox "$file" -n spectrogram "$outfile" done

ファイルの命名については、 sox（1）のマニュアルページを使用すると、コマンドラインで出力ファイルに明示的に名前を付けて、ループ内で使用できるようになります。

ループの最初の行では、Bashのパラメータ置換を使用して、ファイル名から.flac拡張子を取り除き、.png拡張子を追加します。

Mila Nautikus · Answer

これが...への私の「スペクトログラムを取得」ソリューションです。

aac、opusなどのより多くのコーデックを処理する
mp4、mkv、avi、m4aのような多くのコンテナを処理する
スペクトログラムの高さを24kHzに正規化する
1つのチャネルのみをプロット=モノ
音量を正規化する
出力ファイルの入力ファイル拡張子を保持する

#!/bin/bash # aspec.sh # get spectrograms of audio streams # # usage: aspec.sh a.mp3 b.m4a c.mp4 d.mkv .... # # dependencies: sox, ffmpeg # license: public domain, warranty: none # version: 2019-05-17 by milahu ff_args="" # ffmpeg arguments sx_args="" # sox arguments ff_args+=" -loglevel error" ff_astream=0 # only use first audio stream ff_args+=" -map 0:a:${ff_astream}?" ff_args+=" -ac 1" # use only one audio channel sx_args+=" channels 1" sx_args+=" gain -n -3" # normalize volume to -3dB # set sampling rate # only analyze frequencies below f_max = rate / 2 # also normalize spectrogram height to f_max #sx_args+=" rate 6k" # only show f < 3kHz "where the human auditory system is most sensitive" sx_args+=" rate 48k" # only show f < 24kHz # use wav as temporary format, if sox cant read file ff_args+=" -c:a pcm_s16le -f wav" sx_type="wav" # process files from "argv" for i in "$@" do echo "$i" o="$i.sg.png" # output file t=$(basename "$i") # title above spectrogram c="spectrogram by SoX, the Sound eXchange tool" # comment below spectrogram # try to read original format echo analyze sox "$i" -n \ $sx_args \ spectrogram -o "$o" -c "$c" -t "$t" \ 2>&1 | grep -v "no handler for detected file type" if (( ${PIPESTATUS[0]} != 0 )) then # sox failed. convert audio and retry echo convert # get duration of stream or container # spectrogram filter has no "ignore length" option # and without a "duration prediction" will only read 8 seconds d=$(ffprobe "$i" -v error -of compact=s=_ \ -select_streams "0:a:${ff_astream}?" \ -show_entries stream=duration:format=duration \ | sort | grep -v =N/A \ | tail -n 1 | cut -d= -f2) # 'tail -n 1' --> prefer stream duration # 'head -n 1' --> prefer container duration if [[ -z "$d" ]] then echo -e "skip. duration not found FIXME
" continue fi # bash "process substitution" magic sox \ --type "$sx_type" \ --ignore-length \ <( ffmpeg -i "$i" $ff_args - ) \ --null \ $sx_args \ spectrogram -d "$d" -o "$o" -c "$c" -t "$t" fi echo -e "done
$o
" done