About soundview.cgi


soundview.cgi is a CGI script that displays a segment from a soundfile as a waveform and a spectrogram along with some label file data (such as phoneme labels). It also allows stepping through the soundfile and playback of the displayed region. It all runs on the server side, so it can be used with almost any graphical web browser. It was developed during the STP project, when we were using ESPS/xwaves+ for hand-labeling a chunk of the Switchboard corpus, and we wanted to be able to display some examples on the web. Thus, the display portion emulates, to some extent, the xwaves labeler display.

soundview.cgi takes its arguments as part of the URL ("GET" format), and thus it can be easily linked into other pages to generate specific displays on-the-fly. Each page involves recalculating the Short-time Fourier transform of the specified chunk of sound, so it can add quite a load to the web server.

The actual sound file being accessed (expressed as a path local to the webserver machine) and the time extents of the view can be edited in the entries below the display and resubmitted to generate a new view.

What it looks like

Below is a screenshot of one of the examples from the STP site:


Updated: $Date: 1999/05/20 22:17:00 $
DAn Ellis <dpwe@icsi.berkeley.edu>
International Computer Science Institute, Berkeley CA