fofr / audio-to-waveform
Create a waveform video from audio
Prediction
fofr/audio-to-waveform:bb8e292b3c09ee44a941b24c0f9763cce27e773a2642f9618f1db29ed858c3a3Input
- audio
- Video Player is loading.Current Time 00:00:000/Duration 00:00:000Loaded: 0%Stream Type LIVERemaining Time -00:00:0001x
- Chapters
- descriptions off, selected
- captions settings, opens captions settings dialog
- captions off, selected
This is a modal window.
Beginning of dialog window. Escape will cancel and close the window.
End of dialog window.
- bg_color
- #000000
- fg_alpha
- 0.75
- bar_count
- 100
- bar_width
- 0.4
- bars_color
- #ffffff
{ "audio": "https://replicate.delivery/pbxt/IzkaYwR3MwRvUtjDSfz5jreFa4fcvsZBDnDBbWgr4CeKWsss/20230613T092714835Z_20s_hardstyle%2C_syncopated_clav.wav", "bg_color": "#000000", "fg_alpha": 0.75, "bar_count": 100, "bar_width": 0.4, "bars_color": "#ffffff" }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; import fs from "node:fs"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run fofr/audio-to-waveform using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "fofr/audio-to-waveform:bb8e292b3c09ee44a941b24c0f9763cce27e773a2642f9618f1db29ed858c3a3", { input: { audio: "https://replicate.delivery/pbxt/IzkaYwR3MwRvUtjDSfz5jreFa4fcvsZBDnDBbWgr4CeKWsss/20230613T092714835Z_20s_hardstyle%2C_syncopated_clav.wav", bg_color: "#000000", fg_alpha: 0.75, bar_count: 100, bar_width: 0.4, bars_color: "#ffffff" } } ); // To access the file URL: console.log(output.url()); //=> "http://example.com" // To write the file to disk: fs.writeFile("my-image.png", output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run fofr/audio-to-waveform using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "fofr/audio-to-waveform:bb8e292b3c09ee44a941b24c0f9763cce27e773a2642f9618f1db29ed858c3a3", input={ "audio": "https://replicate.delivery/pbxt/IzkaYwR3MwRvUtjDSfz5jreFa4fcvsZBDnDBbWgr4CeKWsss/20230613T092714835Z_20s_hardstyle%2C_syncopated_clav.wav", "bg_color": "#000000", "fg_alpha": 0.75, "bar_count": 100, "bar_width": 0.4, "bars_color": "#ffffff" } ) # To access the file URL: print(output.url()) #=> "http://example.com" # To write the file to disk: with open("my-image.png", "wb") as file: file.write(output.read())
To learn more, take a look at the guide on getting started with Python.
Run fofr/audio-to-waveform using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "fofr/audio-to-waveform:bb8e292b3c09ee44a941b24c0f9763cce27e773a2642f9618f1db29ed858c3a3", "input": { "audio": "https://replicate.delivery/pbxt/IzkaYwR3MwRvUtjDSfz5jreFa4fcvsZBDnDBbWgr4CeKWsss/20230613T092714835Z_20s_hardstyle%2C_syncopated_clav.wav", "bg_color": "#000000", "fg_alpha": 0.75, "bar_count": 100, "bar_width": 0.4, "bars_color": "#ffffff" } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
{ "completed_at": "2023-06-13T17:19:15.421050Z", "created_at": "2023-06-13T17:19:11.973469Z", "data_removed": false, "error": null, "id": "aab3qizbpomh2z4wxrnvjxr2u4", "input": { "audio": "https://replicate.delivery/pbxt/IzkaYwR3MwRvUtjDSfz5jreFa4fcvsZBDnDBbWgr4CeKWsss/20230613T092714835Z_20s_hardstyle%2C_syncopated_clav.wav", "bg_color": "#000000", "fg_alpha": 0.75, "bar_count": 100, "bar_width": 0.4, "bars_color": "#ffffff" }, "logs": "/usr/local/lib/python3.9/site-packages/matplotlib/axes/_axes.py:2229: RuntimeWarning: overflow encountered in scalar add\ndx = [convert(x0 + ddx) - x for ddx in dx]\n/usr/local/lib/python3.9/site-packages/matplotlib/axes/_axes.py:2229: RuntimeWarning: overflow encountered in scalar subtract\ndx = [convert(x0 + ddx) - x for ddx in dx]\n/usr/local/lib/python3.9/site-packages/matplotlib/patches.py:739: RuntimeWarning: overflow encountered in scalar add\ny1 = self.convert_yunits(self._y0 + self._height)\nffmpeg version 4.3.6-0+deb11u1 Copyright (c) 2000-2023 the FFmpeg developers\nbuilt with gcc 10 (Debian 10.2.1-6)\nconfiguration: --prefix=/usr --extra-version=0+deb11u1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared\nlibavutil 56. 51.100 / 56. 51.100\nlibavcodec 58. 91.100 / 58. 91.100\nlibavformat 58. 45.100 / 58. 45.100\nlibavdevice 58. 10.100 / 58. 10.100\nlibavfilter 7. 85.100 / 7. 85.100\nlibavresample 4. 0. 0 / 4. 0. 0\nlibswscale 5. 7.100 / 5. 7.100\nlibswresample 3. 7.100 / 3. 7.100\nlibpostproc 55. 7.100 / 55. 7.100\nInput #0, png_pipe, from '/tmp/tmp89myshba.png':\nDuration: N/A, bitrate: N/A\nStream #0:0: Video: png, rgba(pc), 1000x200, 25 fps, 25 tbr, 25 tbn, 25 tbc\nGuessed Channel Layout for Input Stream #1.0 : mono\nInput #1, wav, from '/tmp/tmpnjubhz2v20230613T092714835Z_20s_hardstyle,_syncopated_clav.wav':\nDuration: 00:00:08.00, bitrate: 512 kb/s\nStream #1:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 32000 Hz, mono, s16, 512 kb/s\nStream mapping:\nStream #0:0 -> #0:0 (png (native) -> h264 (libx264))\nStream #1:0 -> #0:1 (pcm_s16le (native) -> aac (native))\nPress [q] to stop, [?] for help\n[libx264 @ 0x5577f5607780] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2\n[libx264 @ 0x5577f5607780] profile High, level 3.0, 4:2:0, 8-bit\n[libx264 @ 0x5577f5607780] 264 - core 160 r3011 cde9a93 - H.264/MPEG-4 AVC codec - Copyleft 2003-2020 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=6 lookahead_threads=1 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00\nOutput #0, mp4, to '/tmp/tmpcnlbtd13.mp4':\nMetadata:\nencoder : Lavf58.45.100\nStream #0:0: Video: h264 (libx264) (avc1 / 0x31637661), yuv420p(progressive), 1000x200, q=-1--1, 25 fps, 12800 tbn, 25 tbc\nMetadata:\nencoder : Lavc58.91.100 libx264\nSide data:\ncpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A\nStream #0:1: Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 69 kb/s\nMetadata:\nencoder : Lavc58.91.100 aac\nframe= 181 fps=0.0 q=28.0 size= 0kB time=00:00:05.12 bitrate= 0.1kbits/s speed=10.2x\nframe= 200 fps=0.0 q=-1.0 Lsize= 103kB time=00:00:08.00 bitrate= 105.3kbits/s speed= 13x\nvideo:28kB audio:68kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 6.943090%\n[libx264 @ 0x5577f5607780] frame I:1 Avg QP:24.86 size: 4322\n[libx264 @ 0x5577f5607780] frame P:156 Avg QP:25.90 size: 137\n[libx264 @ 0x5577f5607780] frame B:43 Avg QP:20.31 size: 58\n[libx264 @ 0x5577f5607780] consecutive B-frames: 60.5% 30.0% 7.5% 2.0%\n[libx264 @ 0x5577f5607780] mb I I16..4: 52.4% 17.8% 29.8%\n[libx264 @ 0x5577f5607780] mb P I16..4: 1.4% 0.1% 0.3% P16..4: 1.5% 0.3% 0.0% 0.0% 0.0% skip:96.4%\n[libx264 @ 0x5577f5607780] mb B I16..4: 0.5% 0.0% 0.0% B16..8: 2.4% 0.2% 0.0% direct: 0.0% skip:96.7% L0:48.2% L1:51.1% BI: 0.7%\n[libx264 @ 0x5577f5607780] 8x8 transform intra:9.9% inter:29.4%\n[libx264 @ 0x5577f5607780] coded y,uvDC,uvAC intra: 7.0% 0.0% 0.0% inter: 0.1% 0.0% 0.0%\n[libx264 @ 0x5577f5607780] i16 v,h,dc,p: 96% 1% 3% 0%\n[libx264 @ 0x5577f5607780] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 36% 7% 57% 0% 0% 0% 0% 0% 0%\n[libx264 @ 0x5577f5607780] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 77% 5% 13% 0% 1% 1% 1% 1% 0%\n[libx264 @ 0x5577f5607780] i8c dc,h,v,p: 100% 0% 0% 0%\n[libx264 @ 0x5577f5607780] Weighted P-Frames: Y:0.0% UV:0.0%\n[libx264 @ 0x5577f5607780] ref P L0: 89.8% 1.0% 6.2% 3.0%\n[libx264 @ 0x5577f5607780] ref B L0: 86.6% 13.1% 0.3%\n[libx264 @ 0x5577f5607780] ref B L1: 98.6% 1.4%\n[libx264 @ 0x5577f5607780] kb/s:28.18\n[aac @ 0x5577f5608d40] Qavg: 127.395", "metrics": { "predict_time": 1.54515, "total_time": 3.447581 }, "output": "https://replicate.delivery/pbxt/QolvuYWaR4afViluel8wSNXZKrqcrlFxbdyP4DfZzUxm4KLiA/tmpcnlbtd13.mp4", "started_at": "2023-06-13T17:19:13.875900Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/aab3qizbpomh2z4wxrnvjxr2u4", "cancel": "https://api.replicate.com/v1/predictions/aab3qizbpomh2z4wxrnvjxr2u4/cancel" }, "version": "bb8e292b3c09ee44a941b24c0f9763cce27e773a2642f9618f1db29ed858c3a3" }
Generated in/usr/local/lib/python3.9/site-packages/matplotlib/axes/_axes.py:2229: RuntimeWarning: overflow encountered in scalar add dx = [convert(x0 + ddx) - x for ddx in dx] /usr/local/lib/python3.9/site-packages/matplotlib/axes/_axes.py:2229: RuntimeWarning: overflow encountered in scalar subtract dx = [convert(x0 + ddx) - x for ddx in dx] /usr/local/lib/python3.9/site-packages/matplotlib/patches.py:739: RuntimeWarning: overflow encountered in scalar add y1 = self.convert_yunits(self._y0 + self._height) ffmpeg version 4.3.6-0+deb11u1 Copyright (c) 2000-2023 the FFmpeg developers built with gcc 10 (Debian 10.2.1-6) configuration: --prefix=/usr --extra-version=0+deb11u1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared libavutil 56. 51.100 / 56. 51.100 libavcodec 58. 91.100 / 58. 91.100 libavformat 58. 45.100 / 58. 45.100 libavdevice 58. 10.100 / 58. 10.100 libavfilter 7. 85.100 / 7. 85.100 libavresample 4. 0. 0 / 4. 0. 0 libswscale 5. 7.100 / 5. 7.100 libswresample 3. 7.100 / 3. 7.100 libpostproc 55. 7.100 / 55. 7.100 Input #0, png_pipe, from '/tmp/tmp89myshba.png': Duration: N/A, bitrate: N/A Stream #0:0: Video: png, rgba(pc), 1000x200, 25 fps, 25 tbr, 25 tbn, 25 tbc Guessed Channel Layout for Input Stream #1.0 : mono Input #1, wav, from '/tmp/tmpnjubhz2v20230613T092714835Z_20s_hardstyle,_syncopated_clav.wav': Duration: 00:00:08.00, bitrate: 512 kb/s Stream #1:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 32000 Hz, mono, s16, 512 kb/s Stream mapping: Stream #0:0 -> #0:0 (png (native) -> h264 (libx264)) Stream #1:0 -> #0:1 (pcm_s16le (native) -> aac (native)) Press [q] to stop, [?] for help [libx264 @ 0x5577f5607780] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2 [libx264 @ 0x5577f5607780] profile High, level 3.0, 4:2:0, 8-bit [libx264 @ 0x5577f5607780] 264 - core 160 r3011 cde9a93 - H.264/MPEG-4 AVC codec - Copyleft 2003-2020 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=6 lookahead_threads=1 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00 Output #0, mp4, to '/tmp/tmpcnlbtd13.mp4': Metadata: encoder : Lavf58.45.100 Stream #0:0: Video: h264 (libx264) (avc1 / 0x31637661), yuv420p(progressive), 1000x200, q=-1--1, 25 fps, 12800 tbn, 25 tbc Metadata: encoder : Lavc58.91.100 libx264 Side data: cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A Stream #0:1: Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 69 kb/s Metadata: encoder : Lavc58.91.100 aac frame= 181 fps=0.0 q=28.0 size= 0kB time=00:00:05.12 bitrate= 0.1kbits/s speed=10.2x frame= 200 fps=0.0 q=-1.0 Lsize= 103kB time=00:00:08.00 bitrate= 105.3kbits/s speed= 13x video:28kB audio:68kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 6.943090% [libx264 @ 0x5577f5607780] frame I:1 Avg QP:24.86 size: 4322 [libx264 @ 0x5577f5607780] frame P:156 Avg QP:25.90 size: 137 [libx264 @ 0x5577f5607780] frame B:43 Avg QP:20.31 size: 58 [libx264 @ 0x5577f5607780] consecutive B-frames: 60.5% 30.0% 7.5% 2.0% [libx264 @ 0x5577f5607780] mb I I16..4: 52.4% 17.8% 29.8% [libx264 @ 0x5577f5607780] mb P I16..4: 1.4% 0.1% 0.3% P16..4: 1.5% 0.3% 0.0% 0.0% 0.0% skip:96.4% [libx264 @ 0x5577f5607780] mb B I16..4: 0.5% 0.0% 0.0% B16..8: 2.4% 0.2% 0.0% direct: 0.0% skip:96.7% L0:48.2% L1:51.1% BI: 0.7% [libx264 @ 0x5577f5607780] 8x8 transform intra:9.9% inter:29.4% [libx264 @ 0x5577f5607780] coded y,uvDC,uvAC intra: 7.0% 0.0% 0.0% inter: 0.1% 0.0% 0.0% [libx264 @ 0x5577f5607780] i16 v,h,dc,p: 96% 1% 3% 0% [libx264 @ 0x5577f5607780] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 36% 7% 57% 0% 0% 0% 0% 0% 0% [libx264 @ 0x5577f5607780] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 77% 5% 13% 0% 1% 1% 1% 1% 0% [libx264 @ 0x5577f5607780] i8c dc,h,v,p: 100% 0% 0% 0% [libx264 @ 0x5577f5607780] Weighted P-Frames: Y:0.0% UV:0.0% [libx264 @ 0x5577f5607780] ref P L0: 89.8% 1.0% 6.2% 3.0% [libx264 @ 0x5577f5607780] ref B L0: 86.6% 13.1% 0.3% [libx264 @ 0x5577f5607780] ref B L1: 98.6% 1.4% [libx264 @ 0x5577f5607780] kb/s:28.18 [aac @ 0x5577f5608d40] Qavg: 127.395
Prediction
fofr/audio-to-waveform:6721fc036a12584b03461349a5c1ee15af5fe2961775c3eeac6640898d4c3ef8Input
- audio
- Video Player is loading.Current Time 00:00:000/Duration 00:00:000Loaded: 0%Stream Type LIVERemaining Time -00:00:0001x
- Chapters
- descriptions off, selected
- captions settings, opens captions settings dialog
- captions off, selected
This is a modal window.
Beginning of dialog window. Escape will cancel and close the window.
End of dialog window.
- bg_color
- #000000
- fg_alpha
- 0.75
- bar_count
- 100
- bar_width
- 0.4
- bars_color
- #ffffff
- caption_text
- 80s trancecore, driving rhythm section, ambient textures, boomwhackers, persian scale mode, tribute recording
{ "audio": "https://replicate.delivery/pbxt/J03sz7ye60eaijccxUfU5wc1W9vwgKIsU47QozjClDmi1bgB/20230613T093211825Z_80s_trancecore%2C_driving_rhythm.wav", "bg_color": "#000000", "fg_alpha": 0.75, "bar_count": 100, "bar_width": 0.4, "bars_color": "#ffffff", "caption_text": "80s trancecore, driving rhythm section, ambient textures, boomwhackers, persian scale mode, tribute recording" }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; import fs from "node:fs"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run fofr/audio-to-waveform using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "fofr/audio-to-waveform:6721fc036a12584b03461349a5c1ee15af5fe2961775c3eeac6640898d4c3ef8", { input: { audio: "https://replicate.delivery/pbxt/J03sz7ye60eaijccxUfU5wc1W9vwgKIsU47QozjClDmi1bgB/20230613T093211825Z_80s_trancecore%2C_driving_rhythm.wav", bg_color: "#000000", fg_alpha: 0.75, bar_count: 100, bar_width: 0.4, bars_color: "#ffffff", caption_text: "80s trancecore, driving rhythm section, ambient textures, boomwhackers, persian scale mode, tribute recording" } } ); // To access the file URL: console.log(output.url()); //=> "http://example.com" // To write the file to disk: fs.writeFile("my-image.png", output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run fofr/audio-to-waveform using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "fofr/audio-to-waveform:6721fc036a12584b03461349a5c1ee15af5fe2961775c3eeac6640898d4c3ef8", input={ "audio": "https://replicate.delivery/pbxt/J03sz7ye60eaijccxUfU5wc1W9vwgKIsU47QozjClDmi1bgB/20230613T093211825Z_80s_trancecore%2C_driving_rhythm.wav", "bg_color": "#000000", "fg_alpha": 0.75, "bar_count": 100, "bar_width": 0.4, "bars_color": "#ffffff", "caption_text": "80s trancecore, driving rhythm section, ambient textures, boomwhackers, persian scale mode, tribute recording" } ) # To access the file URL: print(output.url()) #=> "http://example.com" # To write the file to disk: with open("my-image.png", "wb") as file: file.write(output.read())
To learn more, take a look at the guide on getting started with Python.
Run fofr/audio-to-waveform using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "fofr/audio-to-waveform:6721fc036a12584b03461349a5c1ee15af5fe2961775c3eeac6640898d4c3ef8", "input": { "audio": "https://replicate.delivery/pbxt/J03sz7ye60eaijccxUfU5wc1W9vwgKIsU47QozjClDmi1bgB/20230613T093211825Z_80s_trancecore%2C_driving_rhythm.wav", "bg_color": "#000000", "fg_alpha": 0.75, "bar_count": 100, "bar_width": 0.4, "bars_color": "#ffffff", "caption_text": "80s trancecore, driving rhythm section, ambient textures, boomwhackers, persian scale mode, tribute recording" } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
{ "completed_at": "2023-06-14T14:21:57.767734Z", "created_at": "2023-06-14T14:21:51.798761Z", "data_removed": false, "error": null, "id": "tojgjurb4jkiubngimsj2vfi7q", "input": { "audio": "https://replicate.delivery/pbxt/J03sz7ye60eaijccxUfU5wc1W9vwgKIsU47QozjClDmi1bgB/20230613T093211825Z_80s_trancecore%2C_driving_rhythm.wav", "bg_color": "#000000", "fg_alpha": 0.75, "bar_count": 100, "bar_width": 0.4, "bars_color": "#ffffff", "caption_text": "80s trancecore, driving rhythm section, ambient textures, boomwhackers, persian scale mode, tribute recording" }, "logs": "/usr/local/lib/python3.9/site-packages/matplotlib/axes/_axes.py:2229: RuntimeWarning: overflow encountered in scalar add\ndx = [convert(x0 + ddx) - x for ddx in dx]\n/usr/local/lib/python3.9/site-packages/matplotlib/axes/_axes.py:2229: RuntimeWarning: overflow encountered in scalar subtract\ndx = [convert(x0 + ddx) - x for ddx in dx]\n/usr/local/lib/python3.9/site-packages/matplotlib/patches.py:739: RuntimeWarning: overflow encountered in scalar add\ny1 = self.convert_yunits(self._y0 + self._height)\nffmpeg version 4.3.6-0+deb11u1 Copyright (c) 2000-2023 the FFmpeg developers\nbuilt with gcc 10 (Debian 10.2.1-6)\nconfiguration: --prefix=/usr --extra-version=0+deb11u1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared\nlibavutil 56. 51.100 / 56. 51.100\nlibavcodec 58. 91.100 / 58. 91.100\nlibavformat 58. 45.100 / 58. 45.100\nlibavdevice 58. 10.100 / 58. 10.100\nlibavfilter 7. 85.100 / 7. 85.100\nlibavresample 4. 0. 0 / 4. 0. 0\nlibswscale 5. 7.100 / 5. 7.100\nlibswresample 3. 7.100 / 3. 7.100\nlibpostproc 55. 7.100 / 55. 7.100\nInput #0, png_pipe, from '/tmp/tmpgedemrhh.png':\nDuration: N/A, bitrate: N/A\nStream #0:0: Video: png, rgba(pc), 1000x200, 25 fps, 25 tbr, 25 tbn, 25 tbc\nGuessed Channel Layout for Input Stream #1.0 : mono\nInput #1, wav, from '/tmp/tmpu0zujhxp20230613T093211825Z_80s_trancecore,_driving_rhythm.wav':\nDuration: 00:00:08.00, bitrate: 512 kb/s\nStream #1:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 32000 Hz, mono, s16, 512 kb/s\nStream mapping:\nStream #0:0 -> #0:0 (png (native) -> h264 (libx264))\nStream #1:0 -> #0:1 (pcm_s16le (native) -> aac (native))\nPress [q] to stop, [?] for help\n[libx264 @ 0x56435937b400] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2\n[libx264 @ 0x56435937b400] profile High, level 3.0, 4:2:0, 8-bit\n[libx264 @ 0x56435937b400] 264 - core 160 r3011 cde9a93 - H.264/MPEG-4 AVC codec - Copyleft 2003-2020 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=6 lookahead_threads=1 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00\nOutput #0, mp4, to '/tmp/tmpxwo3ej8k.mp4':\nMetadata:\nencoder : Lavf58.45.100\nStream #0:0: Video: h264 (libx264) (avc1 / 0x31637661), yuv420p(progressive), 1000x200, q=-1--1, 25 fps, 12800 tbn, 25 tbc\nMetadata:\nencoder : Lavc58.91.100 libx264\nSide data:\ncpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A\nStream #0:1: Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 69 kb/s\nMetadata:\nencoder : Lavc58.91.100 aac\nframe= 194 fps=0.0 q=28.0 size= 0kB time=00:00:05.64 bitrate= 0.1kbits/s speed=11.3x\nframe= 200 fps=0.0 q=-1.0 Lsize= 102kB time=00:00:08.00 bitrate= 104.9kbits/s speed=13.4x\nvideo:28kB audio:68kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 7.000663%\n[libx264 @ 0x56435937b400] frame I:1 Avg QP:25.51 size: 4105\n[libx264 @ 0x56435937b400] frame P:157 Avg QP:25.92 size: 134\n[libx264 @ 0x56435937b400] frame B:42 Avg QP:21.38 size: 61\n[libx264 @ 0x56435937b400] consecutive B-frames: 60.5% 33.0% 4.5% 2.0%\n[libx264 @ 0x56435937b400] mb I I16..4: 52.9% 19.7% 27.5%\n[libx264 @ 0x56435937b400] mb P I16..4: 1.3% 0.2% 0.2% P16..4: 1.5% 0.2% 0.1% 0.0% 0.0% skip:96.5%\n[libx264 @ 0x56435937b400] mb B I16..4: 0.4% 0.0% 0.0% B16..8: 2.8% 0.2% 0.0% direct: 0.0% skip:96.5% L0:44.7% L1:55.1% BI: 0.2%\n[libx264 @ 0x56435937b400] 8x8 transform intra:11.5% inter:29.5%\n[libx264 @ 0x56435937b400] coded y,uvDC,uvAC intra: 6.3% 0.0% 0.0% inter: 0.2% 0.0% 0.0%\n[libx264 @ 0x56435937b400] i16 v,h,dc,p: 96% 1% 3% 0%\n[libx264 @ 0x56435937b400] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 24% 5% 71% 0% 0% 0% 0% 0% 0%\n[libx264 @ 0x56435937b400] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 76% 5% 14% 0% 1% 1% 1% 1% 1%\n[libx264 @ 0x56435937b400] i8c dc,h,v,p: 100% 0% 0% 0%\n[libx264 @ 0x56435937b400] Weighted P-Frames: Y:0.0% UV:0.0%\n[libx264 @ 0x56435937b400] ref P L0: 89.0% 1.5% 6.7% 2.9%\n[libx264 @ 0x56435937b400] ref B L0: 85.7% 14.1% 0.2%\n[libx264 @ 0x56435937b400] ref B L1: 99.7% 0.3%\n[libx264 @ 0x56435937b400] kb/s:27.66\n[aac @ 0x56435937c9c0] Qavg: 125.702\nffmpeg version 4.3.6-0+deb11u1 Copyright (c) 2000-2023 the FFmpeg developers\nbuilt with gcc 10 (Debian 10.2.1-6)\nconfiguration: --prefix=/usr --extra-version=0+deb11u1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared\nlibavutil 56. 51.100 / 56. 51.100\nlibavcodec 58. 91.100 / 58. 91.100\nlibavformat 58. 45.100 / 58. 45.100\nlibavdevice 58. 10.100 / 58. 10.100\nlibavfilter 7. 85.100 / 7. 85.100\nlibavresample 4. 0. 0 / 4. 0. 0\nlibswscale 5. 7.100 / 5. 7.100\nlibswresample 3. 7.100 / 3. 7.100\nlibpostproc 55. 7.100 / 55. 7.100\nInput #0, mov,mp4,m4a,3gp,3g2,mj2, from '/tmp/tmpxwo3ej8k.mp4':\nMetadata:\nmajor_brand : isom\nminor_version : 512\ncompatible_brands: isomiso2avc1mp41\nencoder : Lavf58.45.100\nDuration: 00:00:08.03, start: 0.000000, bitrate: 104 kb/s\nStream #0:0(und): Video: h264 (High) (avc1 / 0x31637661), yuv420p, 1000x200, 28 kb/s, 25 fps, 25 tbr, 12800 tbn, 50 tbc (default)\nMetadata:\nhandler_name : VideoHandler\nStream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 69 kb/s (default)\nMetadata:\nhandler_name : SoundHandler\nStream mapping:\nStream #0:0 -> #0:0 (h264 (native) -> h264 (libx264))\nStream #0:1 -> #0:1 (aac (native) -> aac (native))\nPress [q] to stop, [?] for help\n[libx264 @ 0x56116ed0c300] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2\n[libx264 @ 0x56116ed0c300] profile High, level 3.1, 4:2:0, 8-bit\n[libx264 @ 0x56116ed0c300] 264 - core 160 r3011 cde9a93 - H.264/MPEG-4 AVC codec - Copyleft 2003-2020 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=12 lookahead_threads=2 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00\nOutput #0, mp4, to '/tmp/padded_waveform.mp4':\nMetadata:\nmajor_brand : isom\nminor_version : 512\ncompatible_brands: isomiso2avc1mp41\nencoder : Lavf58.45.100\nStream #0:0(und): Video: h264 (libx264) (avc1 / 0x31637661), yuv420p, 1000x666, q=-1--1, 25 fps, 12800 tbn, 25 tbc (default)\nMetadata:\nhandler_name : VideoHandler\nencoder : Lavc58.91.100 libx264\nSide data:\ncpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A\nStream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 69 kb/s (default)\nMetadata:\nhandler_name : SoundHandler\nencoder : Lavc58.91.100 aac\nframe= 174 fps=0.0 q=28.0 size= 0kB time=00:00:07.23 bitrate= 0.1kbits/s speed=14.4x\nframe= 200 fps=0.0 q=-1.0 Lsize= 105kB time=00:00:08.00 bitrate= 107.8kbits/s speed= 12x\nvideo:30kB audio:69kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 6.791316%\n[libx264 @ 0x56116ed0c300] frame I:1 Avg QP:14.36 size: 4613\n[libx264 @ 0x56116ed0c300] frame P:147 Avg QP:15.51 size: 142\n[libx264 @ 0x56116ed0c300] frame B:52 Avg QP:15.72 size: 79\n[libx264 @ 0x56116ed0c300] consecutive B-frames: 55.0% 27.0% 12.0% 6.0%\n[libx264 @ 0x56116ed0c300] mb I I16..4: 87.9% 3.2% 9.0%\n[libx264 @ 0x56116ed0c300] mb P I16..4: 0.4% 0.0% 0.1% P16..4: 0.1% 0.1% 0.0% 0.0% 0.0% skip:99.3%\n[libx264 @ 0x56116ed0c300] mb B I16..4: 0.2% 0.0% 0.0% B16..8: 0.7% 0.1% 0.0% direct: 0.0% skip:99.0% L0:45.7% L1:53.8% BI: 0.5%\n[libx264 @ 0x56116ed0c300] 8x8 transform intra:4.8% inter:34.7%\n[libx264 @ 0x56116ed0c300] coded y,uvDC,uvAC intra: 5.4% 0.0% 0.0% inter: 0.1% 0.0% 0.0%\n[libx264 @ 0x56116ed0c300] i16 v,h,dc,p: 96% 2% 2% 0%\n[libx264 @ 0x56116ed0c300] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 31% 15% 53% 0% 0% 0% 0% 0% 0%\n[libx264 @ 0x56116ed0c300] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 76% 5% 13% 0% 1% 1% 1% 1% 1%\n[libx264 @ 0x56116ed0c300] i8c dc,h,v,p: 100% 0% 0% 0%\n[libx264 @ 0x56116ed0c300] Weighted P-Frames: Y:0.0% UV:0.0%\n[libx264 @ 0x56116ed0c300] ref P L0: 73.5% 4.7% 13.7% 8.0%\n[libx264 @ 0x56116ed0c300] ref B L0: 89.3% 9.2% 1.5%\n[libx264 @ 0x56116ed0c300] ref B L1: 98.7% 1.3%\n[libx264 @ 0x56116ed0c300] kb/s:29.57\n[aac @ 0x56116ed1e800] Qavg: 125.140\nffmpeg version 4.3.6-0+deb11u1 Copyright (c) 2000-2023 the FFmpeg developers\nbuilt with gcc 10 (Debian 10.2.1-6)\nconfiguration: --prefix=/usr --extra-version=0+deb11u1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared\nlibavutil 56. 51.100 / 56. 51.100\nlibavcodec 58. 91.100 / 58. 91.100\nlibavformat 58. 45.100 / 58. 45.100\nlibavdevice 58. 10.100 / 58. 10.100\nlibavfilter 7. 85.100 / 7. 85.100\nlibavresample 4. 0. 0 / 4. 0. 0\nlibswscale 5. 7.100 / 5. 7.100\nlibswresample 3. 7.100 / 3. 7.100\nlibpostproc 55. 7.100 / 55. 7.100\nInput #0, mov,mp4,m4a,3gp,3g2,mj2, from '/tmp/padded_waveform.mp4':\nMetadata:\nmajor_brand : isom\nminor_version : 512\ncompatible_brands: isomiso2avc1mp41\nencoder : Lavf58.45.100\nDuration: 00:00:08.03, start: 0.000000, bitrate: 107 kb/s\nStream #0:0(und): Video: h264 (High) (avc1 / 0x31637661), yuv420p, 1000x666, 30 kb/s, 25 fps, 25 tbr, 12800 tbn, 50 tbc (default)\nMetadata:\nhandler_name : VideoHandler\nStream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 70 kb/s (default)\nMetadata:\nhandler_name : SoundHandler\nInput #1, png_pipe, from '/tmp/background.png':\nDuration: N/A, bitrate: N/A\nStream #1:0: Video: png, gray(pc), 980x447, 25 tbr, 25 tbn, 25 tbc\nStream mapping:\nStream #0:0 (h264) -> overlay:main (graph 0)\nStream #1:0 (png) -> overlay:overlay (graph 0)\noverlay (graph 0) -> Stream #0:0 (libx264)\nStream #0:1 -> #0:1 (aac (native) -> aac (native))\nPress [q] to stop, [?] for help\n[libx264 @ 0x557569ec6900] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2\n[libx264 @ 0x557569ec6900] profile High, level 3.1, 4:2:0, 8-bit\n[libx264 @ 0x557569ec6900] 264 - core 160 r3011 cde9a93 - H.264/MPEG-4 AVC codec - Copyleft 2003-2020 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=12 lookahead_threads=2 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00\nOutput #0, mp4, to '/tmp/final_waveform.mp4':\nMetadata:\nmajor_brand : isom\nminor_version : 512\ncompatible_brands: isomiso2avc1mp41\nencoder : Lavf58.45.100\nStream #0:0: Video: h264 (libx264) (avc1 / 0x31637661), yuv420p, 1000x666, q=-1--1, 25 fps, 12800 tbn, 25 tbc (default)\nMetadata:\nencoder : Lavc58.91.100 libx264\nSide data:\ncpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A\nStream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 69 kb/s (default)\nMetadata:\nhandler_name : SoundHandler\nencoder : Lavc58.91.100 aac\nframe= 147 fps=0.0 q=28.0 size= 0kB time=00:00:06.14 bitrate= 0.1kbits/s speed=12.3x\nframe= 200 fps=0.0 q=-1.0 Lsize= 124kB time=00:00:08.00 bitrate= 126.8kbits/s speed=10.9x\nvideo:48kB audio:69kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 5.740677%\n[libx264 @ 0x557569ec6900] frame I:1 Avg QP:20.26 size: 22748\n[libx264 @ 0x557569ec6900] frame P:149 Avg QP:15.58 size: 146\n[libx264 @ 0x557569ec6900] frame B:50 Avg QP:15.98 size: 89\n[libx264 @ 0x557569ec6900] consecutive B-frames: 56.0% 29.0% 9.0% 6.0%\n[libx264 @ 0x557569ec6900] mb I I16..4: 41.2% 34.0% 24.8%\n[libx264 @ 0x557569ec6900] mb P I16..4: 0.4% 0.0% 0.1% P16..4: 0.2% 0.1% 0.0% 0.0% 0.0% skip:99.3%\n[libx264 @ 0x557569ec6900] mb B I16..4: 0.3% 0.0% 0.0% B16..8: 0.9% 0.1% 0.0% direct: 0.0% skip:98.7% L0:43.4% L1:56.1% BI: 0.4%\n[libx264 @ 0x557569ec6900] 8x8 transform intra:20.4% inter:35.4%\n[libx264 @ 0x557569ec6900] coded y,uvDC,uvAC intra: 11.2% 0.0% 0.0% inter: 0.1% 0.0% 0.0%\n[libx264 @ 0x557569ec6900] i16 v,h,dc,p: 90% 7% 2% 0%\n[libx264 @ 0x557569ec6900] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 73% 8% 18% 0% 0% 0% 0% 0% 0%\n[libx264 @ 0x557569ec6900] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 66% 12% 9% 1% 2% 3% 3% 2% 2%\n[libx264 @ 0x557569ec6900] i8c dc,h,v,p: 100% 0% 0% 0%\n[libx264 @ 0x557569ec6900] Weighted P-Frames: Y:0.0% UV:0.0%\n[libx264 @ 0x557569ec6900] ref P L0: 77.3% 3.7% 12.5% 6.6%\n[libx264 @ 0x557569ec6900] ref B L0: 87.8% 7.9% 4.4%\n[libx264 @ 0x557569ec6900] kb/s:48.97\n[aac @ 0x557569ec5780] Qavg: 122.775", "metrics": { "predict_time": 3.313511, "total_time": 5.968973 }, "output": "https://replicate.delivery/pbxt/4YjofRyRlYzrQKRFje4mImvJHkfKNJIwYt8lyfqieNUogfdRE/final_waveform.mp4", "started_at": "2023-06-14T14:21:54.454223Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/tojgjurb4jkiubngimsj2vfi7q", "cancel": "https://api.replicate.com/v1/predictions/tojgjurb4jkiubngimsj2vfi7q/cancel" }, "version": "6721fc036a12584b03461349a5c1ee15af5fe2961775c3eeac6640898d4c3ef8" }
Generated in/usr/local/lib/python3.9/site-packages/matplotlib/axes/_axes.py:2229: RuntimeWarning: overflow encountered in scalar add dx = [convert(x0 + ddx) - x for ddx in dx] /usr/local/lib/python3.9/site-packages/matplotlib/axes/_axes.py:2229: RuntimeWarning: overflow encountered in scalar subtract dx = [convert(x0 + ddx) - x for ddx in dx] /usr/local/lib/python3.9/site-packages/matplotlib/patches.py:739: RuntimeWarning: overflow encountered in scalar add y1 = self.convert_yunits(self._y0 + self._height) ffmpeg version 4.3.6-0+deb11u1 Copyright (c) 2000-2023 the FFmpeg developers built with gcc 10 (Debian 10.2.1-6) configuration: --prefix=/usr --extra-version=0+deb11u1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared libavutil 56. 51.100 / 56. 51.100 libavcodec 58. 91.100 / 58. 91.100 libavformat 58. 45.100 / 58. 45.100 libavdevice 58. 10.100 / 58. 10.100 libavfilter 7. 85.100 / 7. 85.100 libavresample 4. 0. 0 / 4. 0. 0 libswscale 5. 7.100 / 5. 7.100 libswresample 3. 7.100 / 3. 7.100 libpostproc 55. 7.100 / 55. 7.100 Input #0, png_pipe, from '/tmp/tmpgedemrhh.png': Duration: N/A, bitrate: N/A Stream #0:0: Video: png, rgba(pc), 1000x200, 25 fps, 25 tbr, 25 tbn, 25 tbc Guessed Channel Layout for Input Stream #1.0 : mono Input #1, wav, from '/tmp/tmpu0zujhxp20230613T093211825Z_80s_trancecore,_driving_rhythm.wav': Duration: 00:00:08.00, bitrate: 512 kb/s Stream #1:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 32000 Hz, mono, s16, 512 kb/s Stream mapping: Stream #0:0 -> #0:0 (png (native) -> h264 (libx264)) Stream #1:0 -> #0:1 (pcm_s16le (native) -> aac (native)) Press [q] to stop, [?] for help [libx264 @ 0x56435937b400] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2 [libx264 @ 0x56435937b400] profile High, level 3.0, 4:2:0, 8-bit [libx264 @ 0x56435937b400] 264 - core 160 r3011 cde9a93 - H.264/MPEG-4 AVC codec - Copyleft 2003-2020 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=6 lookahead_threads=1 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00 Output #0, mp4, to '/tmp/tmpxwo3ej8k.mp4': Metadata: encoder : Lavf58.45.100 Stream #0:0: Video: h264 (libx264) (avc1 / 0x31637661), yuv420p(progressive), 1000x200, q=-1--1, 25 fps, 12800 tbn, 25 tbc Metadata: encoder : Lavc58.91.100 libx264 Side data: cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A Stream #0:1: Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 69 kb/s Metadata: encoder : Lavc58.91.100 aac frame= 194 fps=0.0 q=28.0 size= 0kB time=00:00:05.64 bitrate= 0.1kbits/s speed=11.3x frame= 200 fps=0.0 q=-1.0 Lsize= 102kB time=00:00:08.00 bitrate= 104.9kbits/s speed=13.4x video:28kB audio:68kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 7.000663% [libx264 @ 0x56435937b400] frame I:1 Avg QP:25.51 size: 4105 [libx264 @ 0x56435937b400] frame P:157 Avg QP:25.92 size: 134 [libx264 @ 0x56435937b400] frame B:42 Avg QP:21.38 size: 61 [libx264 @ 0x56435937b400] consecutive B-frames: 60.5% 33.0% 4.5% 2.0% [libx264 @ 0x56435937b400] mb I I16..4: 52.9% 19.7% 27.5% [libx264 @ 0x56435937b400] mb P I16..4: 1.3% 0.2% 0.2% P16..4: 1.5% 0.2% 0.1% 0.0% 0.0% skip:96.5% [libx264 @ 0x56435937b400] mb B I16..4: 0.4% 0.0% 0.0% B16..8: 2.8% 0.2% 0.0% direct: 0.0% skip:96.5% L0:44.7% L1:55.1% BI: 0.2% [libx264 @ 0x56435937b400] 8x8 transform intra:11.5% inter:29.5% [libx264 @ 0x56435937b400] coded y,uvDC,uvAC intra: 6.3% 0.0% 0.0% inter: 0.2% 0.0% 0.0% [libx264 @ 0x56435937b400] i16 v,h,dc,p: 96% 1% 3% 0% [libx264 @ 0x56435937b400] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 24% 5% 71% 0% 0% 0% 0% 0% 0% [libx264 @ 0x56435937b400] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 76% 5% 14% 0% 1% 1% 1% 1% 1% [libx264 @ 0x56435937b400] i8c dc,h,v,p: 100% 0% 0% 0% [libx264 @ 0x56435937b400] Weighted P-Frames: Y:0.0% UV:0.0% [libx264 @ 0x56435937b400] ref P L0: 89.0% 1.5% 6.7% 2.9% [libx264 @ 0x56435937b400] ref B L0: 85.7% 14.1% 0.2% [libx264 @ 0x56435937b400] ref B L1: 99.7% 0.3% [libx264 @ 0x56435937b400] kb/s:27.66 [aac @ 0x56435937c9c0] Qavg: 125.702 ffmpeg version 4.3.6-0+deb11u1 Copyright (c) 2000-2023 the FFmpeg developers built with gcc 10 (Debian 10.2.1-6) configuration: --prefix=/usr --extra-version=0+deb11u1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared libavutil 56. 51.100 / 56. 51.100 libavcodec 58. 91.100 / 58. 91.100 libavformat 58. 45.100 / 58. 45.100 libavdevice 58. 10.100 / 58. 10.100 libavfilter 7. 85.100 / 7. 85.100 libavresample 4. 0. 0 / 4. 0. 0 libswscale 5. 7.100 / 5. 7.100 libswresample 3. 7.100 / 3. 7.100 libpostproc 55. 7.100 / 55. 7.100 Input #0, mov,mp4,m4a,3gp,3g2,mj2, from '/tmp/tmpxwo3ej8k.mp4': Metadata: major_brand : isom minor_version : 512 compatible_brands: isomiso2avc1mp41 encoder : Lavf58.45.100 Duration: 00:00:08.03, start: 0.000000, bitrate: 104 kb/s Stream #0:0(und): Video: h264 (High) (avc1 / 0x31637661), yuv420p, 1000x200, 28 kb/s, 25 fps, 25 tbr, 12800 tbn, 50 tbc (default) Metadata: handler_name : VideoHandler Stream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 69 kb/s (default) Metadata: handler_name : SoundHandler Stream mapping: Stream #0:0 -> #0:0 (h264 (native) -> h264 (libx264)) Stream #0:1 -> #0:1 (aac (native) -> aac (native)) Press [q] to stop, [?] for help [libx264 @ 0x56116ed0c300] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2 [libx264 @ 0x56116ed0c300] profile High, level 3.1, 4:2:0, 8-bit [libx264 @ 0x56116ed0c300] 264 - core 160 r3011 cde9a93 - H.264/MPEG-4 AVC codec - Copyleft 2003-2020 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=12 lookahead_threads=2 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00 Output #0, mp4, to '/tmp/padded_waveform.mp4': Metadata: major_brand : isom minor_version : 512 compatible_brands: isomiso2avc1mp41 encoder : Lavf58.45.100 Stream #0:0(und): Video: h264 (libx264) (avc1 / 0x31637661), yuv420p, 1000x666, q=-1--1, 25 fps, 12800 tbn, 25 tbc (default) Metadata: handler_name : VideoHandler encoder : Lavc58.91.100 libx264 Side data: cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A Stream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 69 kb/s (default) Metadata: handler_name : SoundHandler encoder : Lavc58.91.100 aac frame= 174 fps=0.0 q=28.0 size= 0kB time=00:00:07.23 bitrate= 0.1kbits/s speed=14.4x frame= 200 fps=0.0 q=-1.0 Lsize= 105kB time=00:00:08.00 bitrate= 107.8kbits/s speed= 12x video:30kB audio:69kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 6.791316% [libx264 @ 0x56116ed0c300] frame I:1 Avg QP:14.36 size: 4613 [libx264 @ 0x56116ed0c300] frame P:147 Avg QP:15.51 size: 142 [libx264 @ 0x56116ed0c300] frame B:52 Avg QP:15.72 size: 79 [libx264 @ 0x56116ed0c300] consecutive B-frames: 55.0% 27.0% 12.0% 6.0% [libx264 @ 0x56116ed0c300] mb I I16..4: 87.9% 3.2% 9.0% [libx264 @ 0x56116ed0c300] mb P I16..4: 0.4% 0.0% 0.1% P16..4: 0.1% 0.1% 0.0% 0.0% 0.0% skip:99.3% [libx264 @ 0x56116ed0c300] mb B I16..4: 0.2% 0.0% 0.0% B16..8: 0.7% 0.1% 0.0% direct: 0.0% skip:99.0% L0:45.7% L1:53.8% BI: 0.5% [libx264 @ 0x56116ed0c300] 8x8 transform intra:4.8% inter:34.7% [libx264 @ 0x56116ed0c300] coded y,uvDC,uvAC intra: 5.4% 0.0% 0.0% inter: 0.1% 0.0% 0.0% [libx264 @ 0x56116ed0c300] i16 v,h,dc,p: 96% 2% 2% 0% [libx264 @ 0x56116ed0c300] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 31% 15% 53% 0% 0% 0% 0% 0% 0% [libx264 @ 0x56116ed0c300] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 76% 5% 13% 0% 1% 1% 1% 1% 1% [libx264 @ 0x56116ed0c300] i8c dc,h,v,p: 100% 0% 0% 0% [libx264 @ 0x56116ed0c300] Weighted P-Frames: Y:0.0% UV:0.0% [libx264 @ 0x56116ed0c300] ref P L0: 73.5% 4.7% 13.7% 8.0% [libx264 @ 0x56116ed0c300] ref B L0: 89.3% 9.2% 1.5% [libx264 @ 0x56116ed0c300] ref B L1: 98.7% 1.3% [libx264 @ 0x56116ed0c300] kb/s:29.57 [aac @ 0x56116ed1e800] Qavg: 125.140 ffmpeg version 4.3.6-0+deb11u1 Copyright (c) 2000-2023 the FFmpeg developers built with gcc 10 (Debian 10.2.1-6) configuration: --prefix=/usr --extra-version=0+deb11u1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared libavutil 56. 51.100 / 56. 51.100 libavcodec 58. 91.100 / 58. 91.100 libavformat 58. 45.100 / 58. 45.100 libavdevice 58. 10.100 / 58. 10.100 libavfilter 7. 85.100 / 7. 85.100 libavresample 4. 0. 0 / 4. 0. 0 libswscale 5. 7.100 / 5. 7.100 libswresample 3. 7.100 / 3. 7.100 libpostproc 55. 7.100 / 55. 7.100 Input #0, mov,mp4,m4a,3gp,3g2,mj2, from '/tmp/padded_waveform.mp4': Metadata: major_brand : isom minor_version : 512 compatible_brands: isomiso2avc1mp41 encoder : Lavf58.45.100 Duration: 00:00:08.03, start: 0.000000, bitrate: 107 kb/s Stream #0:0(und): Video: h264 (High) (avc1 / 0x31637661), yuv420p, 1000x666, 30 kb/s, 25 fps, 25 tbr, 12800 tbn, 50 tbc (default) Metadata: handler_name : VideoHandler Stream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 70 kb/s (default) Metadata: handler_name : SoundHandler Input #1, png_pipe, from '/tmp/background.png': Duration: N/A, bitrate: N/A Stream #1:0: Video: png, gray(pc), 980x447, 25 tbr, 25 tbn, 25 tbc Stream mapping: Stream #0:0 (h264) -> overlay:main (graph 0) Stream #1:0 (png) -> overlay:overlay (graph 0) overlay (graph 0) -> Stream #0:0 (libx264) Stream #0:1 -> #0:1 (aac (native) -> aac (native)) Press [q] to stop, [?] for help [libx264 @ 0x557569ec6900] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2 [libx264 @ 0x557569ec6900] profile High, level 3.1, 4:2:0, 8-bit [libx264 @ 0x557569ec6900] 264 - core 160 r3011 cde9a93 - H.264/MPEG-4 AVC codec - Copyleft 2003-2020 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=12 lookahead_threads=2 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00 Output #0, mp4, to '/tmp/final_waveform.mp4': Metadata: major_brand : isom minor_version : 512 compatible_brands: isomiso2avc1mp41 encoder : Lavf58.45.100 Stream #0:0: Video: h264 (libx264) (avc1 / 0x31637661), yuv420p, 1000x666, q=-1--1, 25 fps, 12800 tbn, 25 tbc (default) Metadata: encoder : Lavc58.91.100 libx264 Side data: cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A Stream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 69 kb/s (default) Metadata: handler_name : SoundHandler encoder : Lavc58.91.100 aac frame= 147 fps=0.0 q=28.0 size= 0kB time=00:00:06.14 bitrate= 0.1kbits/s speed=12.3x frame= 200 fps=0.0 q=-1.0 Lsize= 124kB time=00:00:08.00 bitrate= 126.8kbits/s speed=10.9x video:48kB audio:69kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 5.740677% [libx264 @ 0x557569ec6900] frame I:1 Avg QP:20.26 size: 22748 [libx264 @ 0x557569ec6900] frame P:149 Avg QP:15.58 size: 146 [libx264 @ 0x557569ec6900] frame B:50 Avg QP:15.98 size: 89 [libx264 @ 0x557569ec6900] consecutive B-frames: 56.0% 29.0% 9.0% 6.0% [libx264 @ 0x557569ec6900] mb I I16..4: 41.2% 34.0% 24.8% [libx264 @ 0x557569ec6900] mb P I16..4: 0.4% 0.0% 0.1% P16..4: 0.2% 0.1% 0.0% 0.0% 0.0% skip:99.3% [libx264 @ 0x557569ec6900] mb B I16..4: 0.3% 0.0% 0.0% B16..8: 0.9% 0.1% 0.0% direct: 0.0% skip:98.7% L0:43.4% L1:56.1% BI: 0.4% [libx264 @ 0x557569ec6900] 8x8 transform intra:20.4% inter:35.4% [libx264 @ 0x557569ec6900] coded y,uvDC,uvAC intra: 11.2% 0.0% 0.0% inter: 0.1% 0.0% 0.0% [libx264 @ 0x557569ec6900] i16 v,h,dc,p: 90% 7% 2% 0% [libx264 @ 0x557569ec6900] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 73% 8% 18% 0% 0% 0% 0% 0% 0% [libx264 @ 0x557569ec6900] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 66% 12% 9% 1% 2% 3% 3% 2% 2% [libx264 @ 0x557569ec6900] i8c dc,h,v,p: 100% 0% 0% 0% [libx264 @ 0x557569ec6900] Weighted P-Frames: Y:0.0% UV:0.0% [libx264 @ 0x557569ec6900] ref P L0: 77.3% 3.7% 12.5% 6.6% [libx264 @ 0x557569ec6900] ref B L0: 87.8% 7.9% 4.4% [libx264 @ 0x557569ec6900] kb/s:48.97 [aac @ 0x557569ec5780] Qavg: 122.775
Prediction
fofr/audio-to-waveform:89d3082a1055872f519b7f5b2b3777dfb8203cc8005da0f6cd8535d635022f54Input
- audio
- Video Player is loading.Current Time 00:00:000/Duration 00:00:000Loaded: 0%Stream Type LIVERemaining Time -00:00:0001x
- Chapters
- descriptions off, selected
- captions settings, opens captions settings dialog
- captions off, selected
This is a modal window.
Beginning of dialog window. Escape will cancel and close the window.
End of dialog window.
- bg_color
- #7142d3
- fg_alpha
- 0.75
- bar_count
- 100
- bar_width
- 0.4
- bars_color
- #f2df53
- caption_text
- There once was a string theorist, Whose dimensions were hard to resist, In the cosmos so vast, He unraveled the past, And the fabric of spacetime untwist.
{ "audio": "https://replicate.delivery/pbxt/J04PMpHOwcAk0NJgqa0K4llhe9etA0QX75axyGMYMY0u3cwm/20230429T215448415Z_There_once.wav", "bg_color": "#7142d3", "fg_alpha": 0.75, "bar_count": 100, "bar_width": 0.4, "bars_color": "#f2df53", "caption_text": "There once was a string theorist,\nWhose dimensions were hard to resist,\nIn the cosmos so vast,\nHe unraveled the past,\nAnd the fabric of spacetime untwist." }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; import fs from "node:fs"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run fofr/audio-to-waveform using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "fofr/audio-to-waveform:89d3082a1055872f519b7f5b2b3777dfb8203cc8005da0f6cd8535d635022f54", { input: { audio: "https://replicate.delivery/pbxt/J04PMpHOwcAk0NJgqa0K4llhe9etA0QX75axyGMYMY0u3cwm/20230429T215448415Z_There_once.wav", bg_color: "#7142d3", fg_alpha: 0.75, bar_count: 100, bar_width: 0.4, bars_color: "#f2df53", caption_text: "There once was a string theorist,\nWhose dimensions were hard to resist,\nIn the cosmos so vast,\nHe unraveled the past,\nAnd the fabric of spacetime untwist." } } ); // To access the file URL: console.log(output.url()); //=> "http://example.com" // To write the file to disk: fs.writeFile("my-image.png", output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run fofr/audio-to-waveform using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "fofr/audio-to-waveform:89d3082a1055872f519b7f5b2b3777dfb8203cc8005da0f6cd8535d635022f54", input={ "audio": "https://replicate.delivery/pbxt/J04PMpHOwcAk0NJgqa0K4llhe9etA0QX75axyGMYMY0u3cwm/20230429T215448415Z_There_once.wav", "bg_color": "#7142d3", "fg_alpha": 0.75, "bar_count": 100, "bar_width": 0.4, "bars_color": "#f2df53", "caption_text": "There once was a string theorist,\nWhose dimensions were hard to resist,\nIn the cosmos so vast,\nHe unraveled the past,\nAnd the fabric of spacetime untwist." } ) # To access the file URL: print(output.url()) #=> "http://example.com" # To write the file to disk: with open("my-image.png", "wb") as file: file.write(output.read())
To learn more, take a look at the guide on getting started with Python.
Run fofr/audio-to-waveform using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "fofr/audio-to-waveform:89d3082a1055872f519b7f5b2b3777dfb8203cc8005da0f6cd8535d635022f54", "input": { "audio": "https://replicate.delivery/pbxt/J04PMpHOwcAk0NJgqa0K4llhe9etA0QX75axyGMYMY0u3cwm/20230429T215448415Z_There_once.wav", "bg_color": "#7142d3", "fg_alpha": 0.75, "bar_count": 100, "bar_width": 0.4, "bars_color": "#f2df53", "caption_text": "There once was a string theorist,\\nWhose dimensions were hard to resist,\\nIn the cosmos so vast,\\nHe unraveled the past,\\nAnd the fabric of spacetime untwist." } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
{ "completed_at": "2023-06-14T14:56:10.293178Z", "created_at": "2023-06-14T14:56:02.152666Z", "data_removed": false, "error": null, "id": "tvmuo2zbxs3rs5rgim5kixevxu", "input": { "audio": "https://replicate.delivery/pbxt/J04PMpHOwcAk0NJgqa0K4llhe9etA0QX75axyGMYMY0u3cwm/20230429T215448415Z_There_once.wav", "bg_color": "#7142d3", "fg_alpha": 0.75, "bar_count": 100, "bar_width": 0.4, "bars_color": "#f2df53", "caption_text": "There once was a string theorist,\nWhose dimensions were hard to resist,\nIn the cosmos so vast,\nHe unraveled the past,\nAnd the fabric of spacetime untwist." }, "logs": "ffmpeg version 4.3.6-0+deb11u1 Copyright (c) 2000-2023 the FFmpeg developers\nbuilt with gcc 10 (Debian 10.2.1-6)\nconfiguration: --prefix=/usr --extra-version=0+deb11u1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared\nlibavutil 56. 51.100 / 56. 51.100\nlibavcodec 58. 91.100 / 58. 91.100\nlibavformat 58. 45.100 / 58. 45.100\nlibavdevice 58. 10.100 / 58. 10.100\nlibavfilter 7. 85.100 / 7. 85.100\nlibavresample 4. 0. 0 / 4. 0. 0\nlibswscale 5. 7.100 / 5. 7.100\nlibswresample 3. 7.100 / 3. 7.100\nlibpostproc 55. 7.100 / 55. 7.100\nInput #0, png_pipe, from '/tmp/tmp5l5a_11b.png':\nDuration: N/A, bitrate: N/A\nStream #0:0: Video: png, rgba(pc), 1000x200, 25 fps, 25 tbr, 25 tbn, 25 tbc\nGuessed Channel Layout for Input Stream #1.0 : mono\nInput #1, wav, from '/tmp/tmp9sssyoun20230429T215448415Z_There_once.wav':\nDuration: 00:00:13.28, bitrate: 768 kb/s\nStream #1:0: Audio: pcm_f32le ([3][0][0][0] / 0x0003), 24000 Hz, mono, flt, 768 kb/s\nStream mapping:\nStream #0:0 -> #0:0 (png (native) -> h264 (libx264))\nStream #1:0 -> #0:1 (pcm_f32le (native) -> aac (native))\nPress [q] to stop, [?] for help\n[libx264 @ 0x5588f771ce00] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2\n[libx264 @ 0x5588f771ce00] profile High, level 3.0, 4:2:0, 8-bit\n[libx264 @ 0x5588f771ce00] 264 - core 160 r3011 cde9a93 - H.264/MPEG-4 AVC codec - Copyleft 2003-2020 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=6 lookahead_threads=1 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00\nOutput #0, mp4, to '/tmp/tmp97oitmz2.mp4':\nMetadata:\nencoder : Lavf58.45.100\nStream #0:0: Video: h264 (libx264) (avc1 / 0x31637661), yuv420p(progressive), 1000x200, q=-1--1, 25 fps, 12800 tbn, 25 tbc\nMetadata:\nencoder : Lavc58.91.100 libx264\nSide data:\ncpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A\nStream #0:1: Audio: aac (LC) (mp4a / 0x6134706D), 24000 Hz, mono, fltp, 69 kb/s\nMetadata:\nencoder : Lavc58.91.100 aac\nframe= 142 fps=0.0 q=28.0 size= 0kB time=00:00:03.62 bitrate= 0.1kbits/s speed=7.19x\nframe= 289 fps=287 q=28.0 size= 0kB time=00:00:09.51 bitrate= 0.0kbits/s speed=9.45x\nframe= 332 fps=265 q=-1.0 Lsize= 156kB time=00:00:13.31 bitrate= 96.2kbits/s speed=10.6x\nvideo:46kB audio:102kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 5.894178%\n[libx264 @ 0x5588f771ce00] frame I:2 Avg QP:14.23 size: 5778\n[libx264 @ 0x5588f771ce00] frame P:176 Avg QP:21.55 size: 151\n[libx264 @ 0x5588f771ce00] frame B:154 Avg QP:16.81 size: 54\n[libx264 @ 0x5588f771ce00] consecutive B-frames: 32.8% 14.5% 4.5% 48.2%\n[libx264 @ 0x5588f771ce00] mb I I16..4: 63.5% 16.7% 19.8%\n[libx264 @ 0x5588f771ce00] mb P I16..4: 1.0% 0.3% 0.1% P16..4: 2.7% 0.2% 0.1% 0.0% 0.0% skip:95.5%\n[libx264 @ 0x5588f771ce00] mb B I16..4: 0.3% 0.0% 0.0% B16..8: 1.9% 0.3% 0.0% direct: 0.0% skip:97.4% L0:39.1% L1:60.5% BI: 0.4%\n[libx264 @ 0x5588f771ce00] 8x8 transform intra:16.6% inter:40.6%\n[libx264 @ 0x5588f771ce00] coded y,uvDC,uvAC intra: 5.5% 18.0% 15.6% inter: 0.1% 0.2% 0.2%\n[libx264 @ 0x5588f771ce00] i16 v,h,dc,p: 94% 1% 5% 0%\n[libx264 @ 0x5588f771ce00] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 25% 2% 72% 0% 0% 0% 0% 0% 0%\n[libx264 @ 0x5588f771ce00] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 74% 6% 14% 0% 1% 1% 1% 1% 1%\n[libx264 @ 0x5588f771ce00] i8c dc,h,v,p: 39% 2% 59% 0%\n[libx264 @ 0x5588f771ce00] Weighted P-Frames: Y:0.0% UV:0.0%\n[libx264 @ 0x5588f771ce00] ref P L0: 92.9% 0.5% 5.1% 1.5%\n[libx264 @ 0x5588f771ce00] ref B L0: 82.8% 16.5% 0.6%\n[libx264 @ 0x5588f771ce00] ref B L1: 99.3% 0.7%\n[libx264 @ 0x5588f771ce00] kb/s:28.00\n[aac @ 0x5588f771e3c0] Qavg: 20273.711\nffmpeg version 4.3.6-0+deb11u1 Copyright (c) 2000-2023 the FFmpeg developers\nbuilt with gcc 10 (Debian 10.2.1-6)\nconfiguration: --prefix=/usr --extra-version=0+deb11u1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared\nlibavutil 56. 51.100 / 56. 51.100\nlibavcodec 58. 91.100 / 58. 91.100\nlibavformat 58. 45.100 / 58. 45.100\nlibavdevice 58. 10.100 / 58. 10.100\nlibavfilter 7. 85.100 / 7. 85.100\nlibavresample 4. 0. 0 / 4. 0. 0\nlibswscale 5. 7.100 / 5. 7.100\nlibswresample 3. 7.100 / 3. 7.100\nlibpostproc 55. 7.100 / 55. 7.100\nInput #0, mov,mp4,m4a,3gp,3g2,mj2, from '/tmp/tmp97oitmz2.mp4':\nMetadata:\nmajor_brand : isom\nminor_version : 512\ncompatible_brands: isomiso2avc1mp41\nencoder : Lavf58.45.100\nDuration: 00:00:13.32, start: 0.000000, bitrate: 96 kb/s\nStream #0:0(und): Video: h264 (High) (avc1 / 0x31637661), yuv420p, 1000x200, 28 kb/s, 25 fps, 25 tbr, 12800 tbn, 50 tbc (default)\nMetadata:\nhandler_name : VideoHandler\nStream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 24000 Hz, mono, fltp, 62 kb/s (default)\nMetadata:\nhandler_name : SoundHandler\nStream mapping:\nStream #0:0 -> #0:0 (h264 (native) -> h264 (libx264))\nStream #0:1 -> #0:1 (aac (native) -> aac (native))\nPress [q] to stop, [?] for help\n[libx264 @ 0x55e07517fa40] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2\n[libx264 @ 0x55e07517fa40] profile High, level 3.1, 4:2:0, 8-bit\n[libx264 @ 0x55e07517fa40] 264 - core 160 r3011 cde9a93 - H.264/MPEG-4 AVC codec - Copyleft 2003-2020 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=12 lookahead_threads=2 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00\nOutput #0, mp4, to '/tmp/padded_waveform.mp4':\nMetadata:\n major_brand : isom\n minor_version : 512\ncompatible_brands: isomiso2avc1mp41\nencoder : Lavf58.45.100\nStream #0:0(und): Video: h264 (libx264) (avc1 / 0x31637661), yuv420p, 1000x666, q=-1--1, 25 fps, 12800 tbn, 25 tbc (default)\nMetadata:\nhandler_name : VideoHandler\nencoder : Lavc58.91.100 libx264\nSide data:\ncpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A\nStream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 24000 Hz, mono, fltp, 69 kb/s (default)\nMetadata:\nhandler_name : SoundHandler\nencoder : Lavc58.91.100 aac\nframe= 103 fps=0.0 q=28.0 size= 0kB time=00:00:04.39 bitrate= 0.1kbits/s speed=8.79x\nframe= 268 fps=267 q=28.0 size= 0kB time=00:00:10.96 bitrate= 0.0kbits/s speed=10.9x\nframe= 332 fps=261 q=-1.0 Lsize= 157kB time=00:00:13.31 bitrate= 96.6kbits/s speed=10.5x\nvideo:48kB audio:100kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 6.233152%\n[libx264 @ 0x55e07517fa40] frame I:2 Avg QP:10.29 size: 6298\n[libx264 @ 0x55e07517fa40] frame P:123 Avg QP:13.06 size: 160\n[libx264 @ 0x55e07517fa40] frame B:207 Avg QP:14.84 size: 77\n[libx264 @ 0x55e07517fa40] consecutive B-frames: 9.9% 18.1% 8.1% 63.9%\n[libx264 @ 0x55e07517fa40] mb I I16..4: 93.0% 0.3% 6.7%\n[libx264 @ 0x55e07517fa40] mb P I16..4: 0.4% 0.0% 0.0% P16..4: 0.2% 0.1% 0.0% 0.0% 0.0% skip:99.2%\n[libx264 @ 0x55e07517fa40] mb B I16..4: 0.2% 0.0% 0.0% B16..8: 0.5% 0.1% 0.0% direct: 0.0% skip:99.2% L0:44.7% L1:54.6% BI: 0.6%\n[libx264 @ 0x55e07517fa40] 8x8 transform intra:1.9% inter:42.1%\n[libx264 @ 0x55e07517fa40] coded y,uvDC,uvAC intra: 3.7% 10.9% 8.8% inter: 0.0% 0.1% 0.1%\n[libx264 @ 0x55e07517fa40] i16 v,h,dc,p: 97% 1% 2% 0%\n[libx264 @ 0x55e07517fa40] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 12% 7% 79% 1% 0% 0% 0% 0% 0%\n[libx264 @ 0x55e07517fa40] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 74% 7% 14% 0% 1% 1% 1% 1% 0%\n[libx264 @ 0x55e07517fa40] i8c dc,h,v,p: 63% 3% 34% 0%\n[libx264 @ 0x55e07517fa40] Weighted P-Frames: Y:0.0% UV:0.0%\n[libx264 @ 0x55e07517fa40] ref P L0: 71.8% 2.2% 20.6% 5.4%\n[libx264 @ 0x55e07517fa40] ref B L0: 86.6% 12.1% 1.3%\n[libx264 @ 0x55e07517fa40] ref B L1: 99.0% 1.0%\n[libx264 @ 0x55e07517fa40] kb/s:29.06\n[aac @ 0x55e0751810c0] Qavg: 21293.207\nffmpeg version 4.3.6-0+deb11u1 Copyright (c) 2000-2023 the FFmpeg developers\nbuilt with gcc 10 (Debian 10.2.1-6)\nconfiguration: --prefix=/usr --extra-version=0+deb11u1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared\nlibavutil 56. 51.100 / 56. 51.100\nlibavcodec 58. 91.100 / 58. 91.100\nlibavformat 58. 45.100 / 58. 45.100\nlibavdevice 58. 10.100 / 58. 10.100\nlibavfilter 7. 85.100 / 7. 85.100\nlibavresample 4. 0. 0 / 4. 0. 0\nlibswscale 5. 7.100 / 5. 7.100\nlibswresample 3. 7.100 / 3. 7.100\nlibpostproc 55. 7.100 / 55. 7.100\nInput #0, mov,mp4,m4a,3gp,3g2,mj2, from '/tmp/padded_waveform.mp4':\nMetadata:\nmajor_brand : isom\nminor_version : 512\ncompatible_brands: isomiso2avc1mp41\nencoder : Lavf58.45.100\nDuration: 00:00:13.36, start: 0.000000, bitrate: 96 kb/s\nStream #0:0(und): Video: h264 (High) (avc1 / 0x31637661), yuv420p, 1000x666, 29 kb/s, 25 fps, 25 tbr, 12800 tbn, 50 tbc (default)\nMetadata:\nhandler_name : VideoHandler\nStream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 24000 Hz, mono, fltp, 61 kb/s (default)\nMetadata:\nhandler_name : SoundHandler\nInput #1, png_pipe, from '/tmp/background.png':\nDuration: N/A, bitrate: N/A\nStream #1:0: Video: png, rgb48be(pc), 980x447, 25 tbr, 25 tbn, 25 tbc\nStream mapping:\nStream #0:0 (h264) -> overlay:main (graph 0)\nStream #1:0 (png) -> overlay:overlay (graph 0)\noverlay (graph 0) -> Stream #0:0 (libx264)\nStream #0:1 -> #0:1 (aac (native) -> aac (native))\nPress [q] to stop, [?] for help\n[libx264 @ 0x55bf473c4880] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2\n[libx264 @ 0x55bf473c4880] profile High, level 3.1, 4:2:0, 8-bit\n[libx264 @ 0x55bf473c4880] 264 - core 160 r3011 cde9a93 - H.264/MPEG-4 AVC codec - Copyleft 2003-2020 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=12 lookahead_threads=2 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00\nOutput #0, mp4, to '/tmp/final_waveform.mp4':\nMetadata:\nmajor_brand : isom\nminor_version : 512\ncompatible_brands: isomiso2avc1mp41\nencoder : Lavf58.45.100\nStream #0:0: Video: h264 (libx264) (avc1 / 0x31637661), yuv420p, 1000x666, q=-1--1, 25 fps, 12800 tbn, 25 tbc (default)\nMetadata:\nencoder : Lavc58.91.100 libx264\nSide data:\ncpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A\nStream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 24000 Hz, mono, fltp, 69 kb/s (default)\nMetadata:\nhandler_name : SoundHandler\nencoder : Lavc58.91.100 aac\nframe= 94 fps=0.0 q=28.0 size= 0kB time=00:00:04.01 bitrate= 0.1kbits/s speed=7.94x\nframe= 215 fps=214 q=28.0 size= 0kB time=00:00:08.83 bitrate= 0.0kbits/s speed=8.78x\nframe= 326 fps=216 q=28.0 size= 0kB time=00:00:13.22 bitrate= 0.0kbits/s speed=8.78x\nframe= 332 fps=198 q=-1.0 Lsize= 237kB time=00:00:13.31 bitrate= 146.0kbits/s speed=7.96x\nvideo:127kB audio:101kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 4.019545%\n[libx264 @ 0x55bf473c4880] frame I:2 Avg QP:16.36 size: 45684\n[libx264 @ 0x55bf473c4880] frame P:124 Avg QP:18.79 size: 178\n[libx264 @ 0x55bf473c4880] frame B:206 Avg QP:20.74 size: 76\n[libx264 @ 0x55bf473c4880] consecutive B-frames: 10.8% 15.7% 10.8% 62.7%\n[libx264 @ 0x55bf473c4880] mb I I16..4: 62.6% 9.8% 27.7%\n[libx264 @ 0x55bf473c4880] mb P I16..4: 0.3% 0.1% 0.0% P16..4: 0.4% 0.1% 0.0% 0.0% 0.0% skip:99.0%\n[libx264 @ 0x55bf473c4880] mb B I16..4: 0.1% 0.0% 0.0% B16..8: 0.6% 0.1% 0.0% direct: 0.0% skip:99.2% L0:40.0% L1:59.4% BI: 0.6%\n[libx264 @ 0x55bf473c4880] 8x8 transform intra:13.4% inter:38.1%\n[libx264 @ 0x55bf473c4880] coded y,uvDC,uvAC intra: 13.6% 27.5% 24.9% inter: 0.0% 0.1% 0.1%\n[libx264 @ 0x55bf473c4880] i16 v,h,dc,p: 91% 6% 3% 0%\n[libx264 @ 0x55bf473c4880] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 27% 8% 65% 0% 0% 0% 0% 0% 0%\n[libx264 @ 0x55bf473c4880] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 57% 14% 12% 2% 3% 4% 4% 3% 3%\n[libx264 @ 0x55bf473c4880] i8c dc,h,v,p: 51% 10% 37% 2%\n[libx264 @ 0x55bf473c4880] Weighted P-Frames: Y:0.0% UV:0.0%\n[libx264 @ 0x55bf473c4880] ref P L0: 81.9% 1.8% 13.6% 2.7%\n[libx264 @ 0x55bf473c4880] ref B L0: 87.4% 10.6% 1.9%\n[libx264 @ 0x55bf473c4880] ref B L1: 98.5% 1.5%\n[libx264 @ 0x55bf473c4880] kb/s:77.69\n[aac @ 0x55bf473c5e80] Qavg: 21020.768", "metrics": { "predict_time": 6.227028, "total_time": 8.140512 }, "output": "https://replicate.delivery/pbxt/28nxWKpL8bZ2F5L7yfTPweqhZs2T63BTb1OC4ZhKKUhKc4FRA/final_waveform.mp4", "started_at": "2023-06-14T14:56:04.066150Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/tvmuo2zbxs3rs5rgim5kixevxu", "cancel": "https://api.replicate.com/v1/predictions/tvmuo2zbxs3rs5rgim5kixevxu/cancel" }, "version": "89d3082a1055872f519b7f5b2b3777dfb8203cc8005da0f6cd8535d635022f54" }
Generated inffmpeg version 4.3.6-0+deb11u1 Copyright (c) 2000-2023 the FFmpeg developers built with gcc 10 (Debian 10.2.1-6) configuration: --prefix=/usr --extra-version=0+deb11u1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared libavutil 56. 51.100 / 56. 51.100 libavcodec 58. 91.100 / 58. 91.100 libavformat 58. 45.100 / 58. 45.100 libavdevice 58. 10.100 / 58. 10.100 libavfilter 7. 85.100 / 7. 85.100 libavresample 4. 0. 0 / 4. 0. 0 libswscale 5. 7.100 / 5. 7.100 libswresample 3. 7.100 / 3. 7.100 libpostproc 55. 7.100 / 55. 7.100 Input #0, png_pipe, from '/tmp/tmp5l5a_11b.png': Duration: N/A, bitrate: N/A Stream #0:0: Video: png, rgba(pc), 1000x200, 25 fps, 25 tbr, 25 tbn, 25 tbc Guessed Channel Layout for Input Stream #1.0 : mono Input #1, wav, from '/tmp/tmp9sssyoun20230429T215448415Z_There_once.wav': Duration: 00:00:13.28, bitrate: 768 kb/s Stream #1:0: Audio: pcm_f32le ([3][0][0][0] / 0x0003), 24000 Hz, mono, flt, 768 kb/s Stream mapping: Stream #0:0 -> #0:0 (png (native) -> h264 (libx264)) Stream #1:0 -> #0:1 (pcm_f32le (native) -> aac (native)) Press [q] to stop, [?] for help [libx264 @ 0x5588f771ce00] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2 [libx264 @ 0x5588f771ce00] profile High, level 3.0, 4:2:0, 8-bit [libx264 @ 0x5588f771ce00] 264 - core 160 r3011 cde9a93 - H.264/MPEG-4 AVC codec - Copyleft 2003-2020 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=6 lookahead_threads=1 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00 Output #0, mp4, to '/tmp/tmp97oitmz2.mp4': Metadata: encoder : Lavf58.45.100 Stream #0:0: Video: h264 (libx264) (avc1 / 0x31637661), yuv420p(progressive), 1000x200, q=-1--1, 25 fps, 12800 tbn, 25 tbc Metadata: encoder : Lavc58.91.100 libx264 Side data: cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A Stream #0:1: Audio: aac (LC) (mp4a / 0x6134706D), 24000 Hz, mono, fltp, 69 kb/s Metadata: encoder : Lavc58.91.100 aac frame= 142 fps=0.0 q=28.0 size= 0kB time=00:00:03.62 bitrate= 0.1kbits/s speed=7.19x frame= 289 fps=287 q=28.0 size= 0kB time=00:00:09.51 bitrate= 0.0kbits/s speed=9.45x frame= 332 fps=265 q=-1.0 Lsize= 156kB time=00:00:13.31 bitrate= 96.2kbits/s speed=10.6x video:46kB audio:102kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 5.894178% [libx264 @ 0x5588f771ce00] frame I:2 Avg QP:14.23 size: 5778 [libx264 @ 0x5588f771ce00] frame P:176 Avg QP:21.55 size: 151 [libx264 @ 0x5588f771ce00] frame B:154 Avg QP:16.81 size: 54 [libx264 @ 0x5588f771ce00] consecutive B-frames: 32.8% 14.5% 4.5% 48.2% [libx264 @ 0x5588f771ce00] mb I I16..4: 63.5% 16.7% 19.8% [libx264 @ 0x5588f771ce00] mb P I16..4: 1.0% 0.3% 0.1% P16..4: 2.7% 0.2% 0.1% 0.0% 0.0% skip:95.5% [libx264 @ 0x5588f771ce00] mb B I16..4: 0.3% 0.0% 0.0% B16..8: 1.9% 0.3% 0.0% direct: 0.0% skip:97.4% L0:39.1% L1:60.5% BI: 0.4% [libx264 @ 0x5588f771ce00] 8x8 transform intra:16.6% inter:40.6% [libx264 @ 0x5588f771ce00] coded y,uvDC,uvAC intra: 5.5% 18.0% 15.6% inter: 0.1% 0.2% 0.2% [libx264 @ 0x5588f771ce00] i16 v,h,dc,p: 94% 1% 5% 0% [libx264 @ 0x5588f771ce00] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 25% 2% 72% 0% 0% 0% 0% 0% 0% [libx264 @ 0x5588f771ce00] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 74% 6% 14% 0% 1% 1% 1% 1% 1% [libx264 @ 0x5588f771ce00] i8c dc,h,v,p: 39% 2% 59% 0% [libx264 @ 0x5588f771ce00] Weighted P-Frames: Y:0.0% UV:0.0% [libx264 @ 0x5588f771ce00] ref P L0: 92.9% 0.5% 5.1% 1.5% [libx264 @ 0x5588f771ce00] ref B L0: 82.8% 16.5% 0.6% [libx264 @ 0x5588f771ce00] ref B L1: 99.3% 0.7% [libx264 @ 0x5588f771ce00] kb/s:28.00 [aac @ 0x5588f771e3c0] Qavg: 20273.711 ffmpeg version 4.3.6-0+deb11u1 Copyright (c) 2000-2023 the FFmpeg developers built with gcc 10 (Debian 10.2.1-6) configuration: --prefix=/usr --extra-version=0+deb11u1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared libavutil 56. 51.100 / 56. 51.100 libavcodec 58. 91.100 / 58. 91.100 libavformat 58. 45.100 / 58. 45.100 libavdevice 58. 10.100 / 58. 10.100 libavfilter 7. 85.100 / 7. 85.100 libavresample 4. 0. 0 / 4. 0. 0 libswscale 5. 7.100 / 5. 7.100 libswresample 3. 7.100 / 3. 7.100 libpostproc 55. 7.100 / 55. 7.100 Input #0, mov,mp4,m4a,3gp,3g2,mj2, from '/tmp/tmp97oitmz2.mp4': Metadata: major_brand : isom minor_version : 512 compatible_brands: isomiso2avc1mp41 encoder : Lavf58.45.100 Duration: 00:00:13.32, start: 0.000000, bitrate: 96 kb/s Stream #0:0(und): Video: h264 (High) (avc1 / 0x31637661), yuv420p, 1000x200, 28 kb/s, 25 fps, 25 tbr, 12800 tbn, 50 tbc (default) Metadata: handler_name : VideoHandler Stream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 24000 Hz, mono, fltp, 62 kb/s (default) Metadata: handler_name : SoundHandler Stream mapping: Stream #0:0 -> #0:0 (h264 (native) -> h264 (libx264)) Stream #0:1 -> #0:1 (aac (native) -> aac (native)) Press [q] to stop, [?] for help [libx264 @ 0x55e07517fa40] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2 [libx264 @ 0x55e07517fa40] profile High, level 3.1, 4:2:0, 8-bit [libx264 @ 0x55e07517fa40] 264 - core 160 r3011 cde9a93 - H.264/MPEG-4 AVC codec - Copyleft 2003-2020 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=12 lookahead_threads=2 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00 Output #0, mp4, to '/tmp/padded_waveform.mp4': Metadata: major_brand : isom minor_version : 512 compatible_brands: isomiso2avc1mp41 encoder : Lavf58.45.100 Stream #0:0(und): Video: h264 (libx264) (avc1 / 0x31637661), yuv420p, 1000x666, q=-1--1, 25 fps, 12800 tbn, 25 tbc (default) Metadata: handler_name : VideoHandler encoder : Lavc58.91.100 libx264 Side data: cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A Stream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 24000 Hz, mono, fltp, 69 kb/s (default) Metadata: handler_name : SoundHandler encoder : Lavc58.91.100 aac frame= 103 fps=0.0 q=28.0 size= 0kB time=00:00:04.39 bitrate= 0.1kbits/s speed=8.79x frame= 268 fps=267 q=28.0 size= 0kB time=00:00:10.96 bitrate= 0.0kbits/s speed=10.9x frame= 332 fps=261 q=-1.0 Lsize= 157kB time=00:00:13.31 bitrate= 96.6kbits/s speed=10.5x video:48kB audio:100kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 6.233152% [libx264 @ 0x55e07517fa40] frame I:2 Avg QP:10.29 size: 6298 [libx264 @ 0x55e07517fa40] frame P:123 Avg QP:13.06 size: 160 [libx264 @ 0x55e07517fa40] frame B:207 Avg QP:14.84 size: 77 [libx264 @ 0x55e07517fa40] consecutive B-frames: 9.9% 18.1% 8.1% 63.9% [libx264 @ 0x55e07517fa40] mb I I16..4: 93.0% 0.3% 6.7% [libx264 @ 0x55e07517fa40] mb P I16..4: 0.4% 0.0% 0.0% P16..4: 0.2% 0.1% 0.0% 0.0% 0.0% skip:99.2% [libx264 @ 0x55e07517fa40] mb B I16..4: 0.2% 0.0% 0.0% B16..8: 0.5% 0.1% 0.0% direct: 0.0% skip:99.2% L0:44.7% L1:54.6% BI: 0.6% [libx264 @ 0x55e07517fa40] 8x8 transform intra:1.9% inter:42.1% [libx264 @ 0x55e07517fa40] coded y,uvDC,uvAC intra: 3.7% 10.9% 8.8% inter: 0.0% 0.1% 0.1% [libx264 @ 0x55e07517fa40] i16 v,h,dc,p: 97% 1% 2% 0% [libx264 @ 0x55e07517fa40] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 12% 7% 79% 1% 0% 0% 0% 0% 0% [libx264 @ 0x55e07517fa40] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 74% 7% 14% 0% 1% 1% 1% 1% 0% [libx264 @ 0x55e07517fa40] i8c dc,h,v,p: 63% 3% 34% 0% [libx264 @ 0x55e07517fa40] Weighted P-Frames: Y:0.0% UV:0.0% [libx264 @ 0x55e07517fa40] ref P L0: 71.8% 2.2% 20.6% 5.4% [libx264 @ 0x55e07517fa40] ref B L0: 86.6% 12.1% 1.3% [libx264 @ 0x55e07517fa40] ref B L1: 99.0% 1.0% [libx264 @ 0x55e07517fa40] kb/s:29.06 [aac @ 0x55e0751810c0] Qavg: 21293.207 ffmpeg version 4.3.6-0+deb11u1 Copyright (c) 2000-2023 the FFmpeg developers built with gcc 10 (Debian 10.2.1-6) configuration: --prefix=/usr --extra-version=0+deb11u1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared libavutil 56. 51.100 / 56. 51.100 libavcodec 58. 91.100 / 58. 91.100 libavformat 58. 45.100 / 58. 45.100 libavdevice 58. 10.100 / 58. 10.100 libavfilter 7. 85.100 / 7. 85.100 libavresample 4. 0. 0 / 4. 0. 0 libswscale 5. 7.100 / 5. 7.100 libswresample 3. 7.100 / 3. 7.100 libpostproc 55. 7.100 / 55. 7.100 Input #0, mov,mp4,m4a,3gp,3g2,mj2, from '/tmp/padded_waveform.mp4': Metadata: major_brand : isom minor_version : 512 compatible_brands: isomiso2avc1mp41 encoder : Lavf58.45.100 Duration: 00:00:13.36, start: 0.000000, bitrate: 96 kb/s Stream #0:0(und): Video: h264 (High) (avc1 / 0x31637661), yuv420p, 1000x666, 29 kb/s, 25 fps, 25 tbr, 12800 tbn, 50 tbc (default) Metadata: handler_name : VideoHandler Stream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 24000 Hz, mono, fltp, 61 kb/s (default) Metadata: handler_name : SoundHandler Input #1, png_pipe, from '/tmp/background.png': Duration: N/A, bitrate: N/A Stream #1:0: Video: png, rgb48be(pc), 980x447, 25 tbr, 25 tbn, 25 tbc Stream mapping: Stream #0:0 (h264) -> overlay:main (graph 0) Stream #1:0 (png) -> overlay:overlay (graph 0) overlay (graph 0) -> Stream #0:0 (libx264) Stream #0:1 -> #0:1 (aac (native) -> aac (native)) Press [q] to stop, [?] for help [libx264 @ 0x55bf473c4880] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2 [libx264 @ 0x55bf473c4880] profile High, level 3.1, 4:2:0, 8-bit [libx264 @ 0x55bf473c4880] 264 - core 160 r3011 cde9a93 - H.264/MPEG-4 AVC codec - Copyleft 2003-2020 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=12 lookahead_threads=2 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00 Output #0, mp4, to '/tmp/final_waveform.mp4': Metadata: major_brand : isom minor_version : 512 compatible_brands: isomiso2avc1mp41 encoder : Lavf58.45.100 Stream #0:0: Video: h264 (libx264) (avc1 / 0x31637661), yuv420p, 1000x666, q=-1--1, 25 fps, 12800 tbn, 25 tbc (default) Metadata: encoder : Lavc58.91.100 libx264 Side data: cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A Stream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 24000 Hz, mono, fltp, 69 kb/s (default) Metadata: handler_name : SoundHandler encoder : Lavc58.91.100 aac frame= 94 fps=0.0 q=28.0 size= 0kB time=00:00:04.01 bitrate= 0.1kbits/s speed=7.94x frame= 215 fps=214 q=28.0 size= 0kB time=00:00:08.83 bitrate= 0.0kbits/s speed=8.78x frame= 326 fps=216 q=28.0 size= 0kB time=00:00:13.22 bitrate= 0.0kbits/s speed=8.78x frame= 332 fps=198 q=-1.0 Lsize= 237kB time=00:00:13.31 bitrate= 146.0kbits/s speed=7.96x video:127kB audio:101kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 4.019545% [libx264 @ 0x55bf473c4880] frame I:2 Avg QP:16.36 size: 45684 [libx264 @ 0x55bf473c4880] frame P:124 Avg QP:18.79 size: 178 [libx264 @ 0x55bf473c4880] frame B:206 Avg QP:20.74 size: 76 [libx264 @ 0x55bf473c4880] consecutive B-frames: 10.8% 15.7% 10.8% 62.7% [libx264 @ 0x55bf473c4880] mb I I16..4: 62.6% 9.8% 27.7% [libx264 @ 0x55bf473c4880] mb P I16..4: 0.3% 0.1% 0.0% P16..4: 0.4% 0.1% 0.0% 0.0% 0.0% skip:99.0% [libx264 @ 0x55bf473c4880] mb B I16..4: 0.1% 0.0% 0.0% B16..8: 0.6% 0.1% 0.0% direct: 0.0% skip:99.2% L0:40.0% L1:59.4% BI: 0.6% [libx264 @ 0x55bf473c4880] 8x8 transform intra:13.4% inter:38.1% [libx264 @ 0x55bf473c4880] coded y,uvDC,uvAC intra: 13.6% 27.5% 24.9% inter: 0.0% 0.1% 0.1% [libx264 @ 0x55bf473c4880] i16 v,h,dc,p: 91% 6% 3% 0% [libx264 @ 0x55bf473c4880] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 27% 8% 65% 0% 0% 0% 0% 0% 0% [libx264 @ 0x55bf473c4880] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 57% 14% 12% 2% 3% 4% 4% 3% 3% [libx264 @ 0x55bf473c4880] i8c dc,h,v,p: 51% 10% 37% 2% [libx264 @ 0x55bf473c4880] Weighted P-Frames: Y:0.0% UV:0.0% [libx264 @ 0x55bf473c4880] ref P L0: 81.9% 1.8% 13.6% 2.7% [libx264 @ 0x55bf473c4880] ref B L0: 87.4% 10.6% 1.9% [libx264 @ 0x55bf473c4880] ref B L1: 98.5% 1.5% [libx264 @ 0x55bf473c4880] kb/s:77.69 [aac @ 0x55bf473c5e80] Qavg: 21020.768
Prediction
fofr/audio-to-waveform:89d3082a1055872f519b7f5b2b3777dfb8203cc8005da0f6cd8535d635022f54ID2n2cgzrbxnqeshbc56apt2hq7mStatusSucceededSourceWebHardwareCPUTotal durationCreatedInput
- audio
- Video Player is loading.Current Time 00:00:000/Duration 00:00:000Loaded: 0%Stream Type LIVERemaining Time -00:00:0001x
- Chapters
- descriptions off, selected
- captions settings, opens captions settings dialog
- captions off, selected
This is a modal window.
Beginning of dialog window. Escape will cancel and close the window.
End of dialog window.
- bg_color
- #5d9e10
- fg_alpha
- 0.75
- bar_count
- 100
- bar_width
- 0.4
- bars_color
- #ffffff
- caption_text
- 20s power metal, infectious hand percussion rhythms, polished vocal production, cajon brush, egyptian scale mode, live stream recording
{ "audio": "https://replicate.delivery/pbxt/J04t1wubCrRB2KTP36LTaaRFHvSNrMxkOexsUhowrC3GlhGu/20230613T093215883Z_20s_power_metal%2C_infectious_ha.wav", "bg_color": "#5d9e10", "fg_alpha": 0.75, "bar_count": 100, "bar_width": 0.4, "bars_color": "#ffffff", "caption_text": "20s power metal, infectious hand percussion rhythms, polished vocal production, cajon brush, egyptian scale mode, live stream recording" }
Install Replicate’s Node.js client library:npm install replicate
Import and set up the client:import Replicate from "replicate"; import fs from "node:fs"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run fofr/audio-to-waveform using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "fofr/audio-to-waveform:89d3082a1055872f519b7f5b2b3777dfb8203cc8005da0f6cd8535d635022f54", { input: { audio: "https://replicate.delivery/pbxt/J04t1wubCrRB2KTP36LTaaRFHvSNrMxkOexsUhowrC3GlhGu/20230613T093215883Z_20s_power_metal%2C_infectious_ha.wav", bg_color: "#5d9e10", fg_alpha: 0.75, bar_count: 100, bar_width: 0.4, bars_color: "#ffffff", caption_text: "20s power metal, infectious hand percussion rhythms, polished vocal production, cajon brush, egyptian scale mode, live stream recording" } } ); // To access the file URL: console.log(output.url()); //=> "http://example.com" // To write the file to disk: fs.writeFile("my-image.png", output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Import the client:import replicate
Run fofr/audio-to-waveform using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "fofr/audio-to-waveform:89d3082a1055872f519b7f5b2b3777dfb8203cc8005da0f6cd8535d635022f54", input={ "audio": "https://replicate.delivery/pbxt/J04t1wubCrRB2KTP36LTaaRFHvSNrMxkOexsUhowrC3GlhGu/20230613T093215883Z_20s_power_metal%2C_infectious_ha.wav", "bg_color": "#5d9e10", "fg_alpha": 0.75, "bar_count": 100, "bar_width": 0.4, "bars_color": "#ffffff", "caption_text": "20s power metal, infectious hand percussion rhythms, polished vocal production, cajon brush, egyptian scale mode, live stream recording" } ) # To access the file URL: print(output.url()) #=> "http://example.com" # To write the file to disk: with open("my-image.png", "wb") as file: file.write(output.read())
To learn more, take a look at the guide on getting started with Python.
Run fofr/audio-to-waveform using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "fofr/audio-to-waveform:89d3082a1055872f519b7f5b2b3777dfb8203cc8005da0f6cd8535d635022f54", "input": { "audio": "https://replicate.delivery/pbxt/J04t1wubCrRB2KTP36LTaaRFHvSNrMxkOexsUhowrC3GlhGu/20230613T093215883Z_20s_power_metal%2C_infectious_ha.wav", "bg_color": "#5d9e10", "fg_alpha": 0.75, "bar_count": 100, "bar_width": 0.4, "bars_color": "#ffffff", "caption_text": "20s power metal, infectious hand percussion rhythms, polished vocal production, cajon brush, egyptian scale mode, live stream recording" } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
{ "completed_at": "2023-06-14T15:27:23.768259Z", "created_at": "2023-06-14T15:27:20.576034Z", "data_removed": false, "error": null, "id": "2n2cgzrbxnqeshbc56apt2hq7m", "input": { "audio": "https://replicate.delivery/pbxt/J04t1wubCrRB2KTP36LTaaRFHvSNrMxkOexsUhowrC3GlhGu/20230613T093215883Z_20s_power_metal%2C_infectious_ha.wav", "bg_color": "#5d9e10", "fg_alpha": 0.75, "bar_count": 100, "bar_width": 0.4, "bars_color": "#ffffff", "caption_text": "20s power metal, infectious hand percussion rhythms, polished vocal production, cajon brush, egyptian scale mode, live stream recording" }, "logs": "/usr/local/lib/python3.9/site-packages/matplotlib/axes/_axes.py:2229: RuntimeWarning: overflow encountered in scalar add\ndx = [convert(x0 + ddx) - x for ddx in dx]\n/usr/local/lib/python3.9/site-packages/matplotlib/axes/_axes.py:2229: RuntimeWarning: overflow encountered in scalar subtract\ndx = [convert(x0 + ddx) - x for ddx in dx]\n/usr/local/lib/python3.9/site-packages/matplotlib/patches.py:739: RuntimeWarning: overflow encountered in scalar add\ny1 = self.convert_yunits(self._y0 + self._height)\nffmpeg version 4.3.6-0+deb11u1 Copyright (c) 2000-2023 the FFmpeg developers\nbuilt with gcc 10 (Debian 10.2.1-6)\nconfiguration: --prefix=/usr --extra-version=0+deb11u1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared\nlibavutil 56. 51.100 / 56. 51.100\nlibavcodec 58. 91.100 / 58. 91.100\nlibavformat 58. 45.100 / 58. 45.100\nlibavdevice 58. 10.100 / 58. 10.100\nlibavfilter 7. 85.100 / 7. 85.100\nlibavresample 4. 0. 0 / 4. 0. 0\nlibswscale 5. 7.100 / 5. 7.100\nlibswresample 3. 7.100 / 3. 7.100\nlibpostproc 55. 7.100 / 55. 7.100\nInput #0, png_pipe, from '/tmp/tmpfig9gymu.png':\nDuration: N/A, bitrate: N/A\nStream #0:0: Video: png, rgba(pc), 1000x200, 25 fps, 25 tbr, 25 tbn, 25 tbc\nGuessed Channel Layout for Input Stream #1.0 : mono\nInput #1, wav, from '/tmp/tmpt80hf47t20230613T093215883Z_20s_power_metal,_infectious_ha.wav':\nDuration: 00:00:08.00, bitrate: 512 kb/s\nStream #1:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 32000 Hz, mono, s16, 512 kb/s\nStream mapping:\nStream #0:0 -> #0:0 (png (native) -> h264 (libx264))\nStream #1:0 -> #0:1 (pcm_s16le (native) -> aac (native))\nPress [q] to stop, [?] for help\n[libx264 @ 0x559b6a8cd700] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2\n[libx264 @ 0x559b6a8cd700] profile High, level 3.0, 4:2:0, 8-bit\n[libx264 @ 0x559b6a8cd700] 264 - core 160 r3011 cde9a93 - H.264/MPEG-4 AVC codec - Copyleft 2003-2020 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=6 lookahead_threads=1 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00\nOutput #0, mp4, to '/tmp/tmp_v8et_m6.mp4':\nMetadata:\nencoder : Lavf58.45.100\nStream #0:0: Video: h264 (libx264) (avc1 / 0x31637661), yuv420p(progressive), 1000x200, q=-1--1, 25 fps, 12800 tbn, 25 tbc\nMetadata:\nencoder : Lavc58.91.100 libx264\nSide data:\ncpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A\nStream #0:1: Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 69 kb/s\nMetadata:\nencoder : Lavc58.91.100 aac\nframe= 200 fps=0.0 q=-1.0 Lsize= 109kB time=00:00:08.00 bitrate= 111.6kbits/s speed=15.1x\nvideo:34kB audio:68kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 6.681644%\n[libx264 @ 0x559b6a8cd700] frame I:1 Avg QP:20.28 size: 7953\n[libx264 @ 0x559b6a8cd700] frame P:145 Avg QP:24.36 size: 156\n[libx264 @ 0x559b6a8cd700] frame B:54 Avg QP:19.79 size: 68\n[libx264 @ 0x559b6a8cd700] consecutive B-frames: 51.0% 36.0% 9.0% 4.0%\n[libx264 @ 0x559b6a8cd700] mb I I16..4: 60.0% 10.5% 29.5%\n[libx264 @ 0x559b6a8cd700] mb P I16..4: 1.2% 0.1% 0.2% P16..4: 0.9% 0.4% 0.1% 0.0% 0.0% skip:97.2%\n[libx264 @ 0x559b6a8cd700] mb B I16..4: 0.4% 0.1% 0.0% B16..8: 2.6% 0.2% 0.0% direct: 0.0% skip:96.6% L0:44.4% L1:55.1% BI: 0.5%\n[libx264 @ 0x559b6a8cd700] 8x8 transform intra:8.9% inter:42.6%\n[libx264 @ 0x559b6a8cd700] coded y,uvDC,uvAC intra: 7.9% 22.7% 18.8% inter: 0.1% 0.3% 0.2%\n[libx264 @ 0x559b6a8cd700] i16 v,h,dc,p: 92% 3% 6% 0%\n[libx264 @ 0x559b6a8cd700] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 19% 7% 72% 0% 0% 0% 0% 0% 0%\n[libx264 @ 0x559b6a8cd700] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 75% 6% 14% 0% 1% 2% 1% 1% 1%\n[libx264 @ 0x559b6a8cd700] i8c dc,h,v,p: 18% 4% 78% 0%\n[libx264 @ 0x559b6a8cd700] Weighted P-Frames: Y:0.0% UV:0.0%\n[libx264 @ 0x559b6a8cd700] ref P L0: 75.5% 3.1% 15.8% 5.6%\n[libx264 @ 0x559b6a8cd700] ref B L0: 87.5% 12.3% 0.2%\n[libx264 @ 0x559b6a8cd700] ref B L1: 99.9% 0.1%\n[libx264 @ 0x559b6a8cd700] kb/s:34.28\n[aac @ 0x559b6a8cecc0] Qavg: 130.508\nffmpeg version 4.3.6-0+deb11u1 Copyright (c) 2000-2023 the FFmpeg developers\nbuilt with gcc 10 (Debian 10.2.1-6)\nconfiguration: --prefix=/usr --extra-version=0+deb11u1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared\nlibavutil 56. 51.100 / 56. 51.100\nlibavcodec 58. 91.100 / 58. 91.100\nlibavformat 58. 45.100 / 58. 45.100\nlibavdevice 58. 10.100 / 58. 10.100\nlibavfilter 7. 85.100 / 7. 85.100\nlibavresample 4. 0. 0 / 4. 0. 0\nlibswscale 5. 7.100 / 5. 7.100\nlibswresample 3. 7.100 / 3. 7.100\nlibpostproc 55. 7.100 / 55. 7.100\nInput #0, mov,mp4,m4a,3gp,3g2,mj2, from '/tmp/tmp_v8et_m6.mp4':\nMetadata:\nmajor_brand : isom\nminor_version : 512\ncompatible_brands: isomiso2avc1mp41\nencoder : Lavf58.45.100\nDuration: 00:00:08.03, start: 0.000000, bitrate: 111 kb/s\nStream #0:0(und): Video: h264 (High) (avc1 / 0x31637661), yuv420p, 1000x200, 34 kb/s, 25 fps, 25 tbr, 12800 tbn, 50 tbc (default)\nMetadata:\nhandler_name : VideoHandler\nStream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 69 kb/s (default)\nMetadata:\nhandler_name : SoundHandler\nStream mapping:\nStream #0:0 -> #0:0 (h264 (native) -> h264 (libx264))\nStream #0:1 -> #0:1 (aac (native) -> aac (native))\nPress [q] to stop, [?] for help\n[libx264 @ 0x5563d8e03f00] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2\n[libx264 @ 0x5563d8e03f00] profile High, level 3.1, 4:2:0, 8-bit\n[libx264 @ 0x5563d8e03f00] 264 - core 160 r3011 cde9a93 - H.264/MPEG-4 AVC codec - Copyleft 2003-2020 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=12 lookahead_threads=2 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00\nOutput #0, mp4, to '/tmp/padded_waveform.mp4':\nMetadata:\nmajor_brand : isom\nminor_version : 512\ncompatible_brands: isomiso2avc1mp41\nencoder : Lavf58.45.100\nStream #0:0(und): Video: h264 (libx264) (avc1 / 0x31637661), yuv420p, 1000x666, q=-1--1, 25 fps, 12800 tbn, 25 tbc (default)\nMetadata:\nhandler_name : VideoHandler\nencoder : Lavc58.91.100 libx264\nSide data:\ncpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A\nStream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 69 kb/s (default)\nMetadata:\nhandler_name : SoundHandler\nencoder : Lavc58.91.100 aac\nframe= 178 fps=0.0 q=28.0 size= 0kB time=00:00:07.39 bitrate= 0.1kbits/s speed=14.6x\nframe= 200 fps=0.0 q=-1.0 Lsize= 113kB time=00:00:08.00 bitrate= 115.3kbits/s speed=12.7x\nvideo:37kB audio:68kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 6.771026%\n[libx264 @ 0x5563d8e03f00] frame I:1 Avg QP:13.05 size: 8743\n[libx264 @ 0x5563d8e03f00] frame P:104 Avg QP:14.62 size: 200\n[libx264 @ 0x5563d8e03f00] frame B:95 Avg QP:15.87 size: 84\n[libx264 @ 0x5563d8e03f00] consecutive B-frames: 25.5% 28.0% 16.5% 30.0%\n[libx264 @ 0x5563d8e03f00] mb I I16..4: 87.3% 2.5% 10.2%\n[libx264 @ 0x5563d8e03f00] mb P I16..4: 0.4% 0.0% 0.1% P16..4: 0.2% 0.1% 0.0% 0.0% 0.0% skip:99.1%\n[libx264 @ 0x5563d8e03f00] mb B I16..4: 0.2% 0.0% 0.0% B16..8: 0.7% 0.1% 0.0% direct: 0.0% skip:99.0% L0:42.2% L1:57.0% BI: 0.9%\n[libx264 @ 0x5563d8e03f00] 8x8 transform intra:4.1% inter:49.2%\n[libx264 @ 0x5563d8e03f00] coded y,uvDC,uvAC intra: 6.4% 16.9% 13.6% inter: 0.0% 0.1% 0.1%\n[libx264 @ 0x5563d8e03f00] i16 v,h,dc,p: 97% 1% 2% 0%\n[libx264 @ 0x5563d8e03f00] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 30% 7% 60% 2% 0% 0% 0% 0% 1%\n[libx264 @ 0x5563d8e03f00] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 76% 5% 14% 0% 1% 1% 1% 1% 0%\n[libx264 @ 0x5563d8e03f00] i8c dc,h,v,p: 49% 2% 48% 0%\n[libx264 @ 0x5563d8e03f00] Weighted P-Frames: Y:0.0% UV:0.0%\n[libx264 @ 0x5563d8e03f00] ref P L0: 70.5% 3.8% 16.1% 9.6%\n[libx264 @ 0x5563d8e03f00] ref B L0: 86.6% 11.5% 1.8%\n[libx264 @ 0x5563d8e03f00] ref B L1: 99.0% 1.0%\n[libx264 @ 0x5563d8e03f00] kb/s:37.52\n[aac @ 0x5563d8e03840] Qavg: 130.633\nffmpeg version 4.3.6-0+deb11u1 Copyright (c) 2000-2023 the FFmpeg developers\nbuilt with gcc 10 (Debian 10.2.1-6)\nconfiguration: --prefix=/usr --extra-version=0+deb11u1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared\nlibavutil 56. 51.100 / 56. 51.100\nlibavcodec 58. 91.100 / 58. 91.100\nlibavformat 58. 45.100 / 58. 45.100\nlibavdevice 58. 10.100 / 58. 10.100\nlibavfilter 7. 85.100 / 7. 85.100\nlibavresample 4. 0. 0 / 4. 0. 0\nlibswscale 5. 7.100 / 5. 7.100\nlibswresample 3. 7.100 / 3. 7.100\nlibpostproc 55. 7.100 / 55. 7.100\nInput #0, mov,mp4,m4a,3gp,3g2,mj2, from '/tmp/padded_waveform.mp4':\nMetadata:\nmajor_brand : isom\nminor_version : 512\ncompatible_brands: isomiso2avc1mp41\nencoder : Lavf58.45.100\nDuration: 00:00:08.03, start: 0.000000, bitrate: 114 kb/s\nStream #0:0(und): Video: h264 (High) (avc1 / 0x31637661), yuv420p, 1000x666, 38 kb/s, 25 fps, 25 tbr, 12800 tbn, 50 tbc (default)\nMetadata:\nhandler_name : VideoHandler\nStream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 69 kb/s (default)\nMetadata:\nhandler_name : SoundHandler\nInput #1, png_pipe, from '/tmp/background.png':\nDuration: N/A, bitrate: N/A\nStream #1:0: Video: png, rgb48be(pc), 980x447, 25 tbr, 25 tbn, 25 tbc\nStream mapping:\nStream #0:0 (h264) -> overlay:main (graph 0)\nStream #1:0 (png) -> overlay:overlay (graph 0)\noverlay (graph 0) -> Stream #0:0 (libx264)\nStream #0:1 -> #0:1 (aac (native) -> aac (native))\nPress [q] to stop, [?] for help\n[libx264 @ 0x55d22a508480] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2\n[libx264 @ 0x55d22a508480] profile High, level 3.1, 4:2:0, 8-bit\n[libx264 @ 0x55d22a508480] 264 - core 160 r3011 cde9a93 - H.264/MPEG-4 AVC codec - Copyleft 2003-2020 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=12 lookahead_threads=2 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00\nOutput #0, mp4, to '/tmp/final_waveform.mp4':\nMetadata:\nmajor_brand : isom\nminor_version : 512\ncompatible_brands: isomiso2avc1mp41\nencoder : Lavf58.45.100\nStream #0:0: Video: h264 (libx264) (avc1 / 0x31637661), yuv420p, 1000x666, q=-1--1, 25 fps, 12800 tbn, 25 tbc (default)\nMetadata:\nencoder : Lavc58.91.100 libx264\nSide data:\ncpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A\nStream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 69 kb/s (default)\nMetadata:\nhandler_name : SoundHandler\nencoder : Lavc58.91.100 aac\nframe= 133 fps=0.0 q=28.0 size= 0kB time=00:00:05.56 bitrate= 0.1kbits/s speed=11.1x\nframe= 200 fps=0.0 q=-1.0 Lsize= 148kB time=00:00:08.00 bitrate= 151.1kbits/s speed=8.59x\nvideo:72kB audio:69kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 5.107691%\n[libx264 @ 0x55d22a508480] frame I:1 Avg QP:18.16 size: 40130\n[libx264 @ 0x55d22a508480] frame P:108 Avg QP:19.99 size: 232\n[libx264 @ 0x55d22a508480] frame B:91 Avg QP:21.80 size: 83\n[libx264 @ 0x55d22a508480] consecutive B-frames: 26.5% 35.0% 10.5% 28.0%\n[libx264 @ 0x55d22a508480] mb I I16..4: 69.5% 1.7% 28.7%\n[libx264 @ 0x55d22a508480] mb P I16..4: 0.5% 0.1% 0.1% P16..4: 1.3% 0.1% 0.0% 0.0% 0.0% skip:97.9%\n[libx264 @ 0x55d22a508480] mb B I16..4: 0.1% 0.0% 0.0% B16..8: 0.9% 0.1% 0.0% direct: 0.0% skip:98.9% L0:44.1% L1:55.2% BI: 0.8%\n[libx264 @ 0x55d22a508480] 8x8 transform intra:6.5% inter:72.3%\n[libx264 @ 0x55d22a508480] coded y,uvDC,uvAC intra: 12.9% 30.7% 23.4% inter: 0.2% 0.6% 0.1%\n[libx264 @ 0x55d22a508480] i16 v,h,dc,p: 94% 4% 2% 0%\n[libx264 @ 0x55d22a508480] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 12% 4% 82% 1% 0% 0% 0% 0% 0%\n[libx264 @ 0x55d22a508480] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 62% 12% 13% 2% 2% 3% 3% 2% 2%\n[libx264 @ 0x55d22a508480] i8c dc,h,v,p: 40% 9% 50% 1%\n[libx264 @ 0x55d22a508480] Weighted P-Frames: Y:0.0% UV:0.0%\n[libx264 @ 0x55d22a508480] ref P L0: 82.1% 0.8% 14.0% 3.2%\n[libx264 @ 0x55d22a508480] ref B L0: 89.6% 9.4% 1.0%\n[libx264 @ 0x55d22a508480] ref B L1: 97.9% 2.1%\n[libx264 @ 0x55d22a508480] kb/s:72.75\n[aac @ 0x55d22a502f00] Qavg: 128.918", "metrics": { "predict_time": 3.272154, "total_time": 3.192225 }, "output": "https://replicate.delivery/pbxt/NBoZFJbtvmaEGZcpGGyKEOfSmFET395MYThgAomwIIjtc8iIA/final_waveform.mp4", "started_at": "2023-06-14T15:27:20.496105Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/2n2cgzrbxnqeshbc56apt2hq7m", "cancel": "https://api.replicate.com/v1/predictions/2n2cgzrbxnqeshbc56apt2hq7m/cancel" }, "version": "89d3082a1055872f519b7f5b2b3777dfb8203cc8005da0f6cd8535d635022f54" }
Generated in/usr/local/lib/python3.9/site-packages/matplotlib/axes/_axes.py:2229: RuntimeWarning: overflow encountered in scalar add dx = [convert(x0 + ddx) - x for ddx in dx] /usr/local/lib/python3.9/site-packages/matplotlib/axes/_axes.py:2229: RuntimeWarning: overflow encountered in scalar subtract dx = [convert(x0 + ddx) - x for ddx in dx] /usr/local/lib/python3.9/site-packages/matplotlib/patches.py:739: RuntimeWarning: overflow encountered in scalar add y1 = self.convert_yunits(self._y0 + self._height) ffmpeg version 4.3.6-0+deb11u1 Copyright (c) 2000-2023 the FFmpeg developers built with gcc 10 (Debian 10.2.1-6) configuration: --prefix=/usr --extra-version=0+deb11u1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared libavutil 56. 51.100 / 56. 51.100 libavcodec 58. 91.100 / 58. 91.100 libavformat 58. 45.100 / 58. 45.100 libavdevice 58. 10.100 / 58. 10.100 libavfilter 7. 85.100 / 7. 85.100 libavresample 4. 0. 0 / 4. 0. 0 libswscale 5. 7.100 / 5. 7.100 libswresample 3. 7.100 / 3. 7.100 libpostproc 55. 7.100 / 55. 7.100 Input #0, png_pipe, from '/tmp/tmpfig9gymu.png': Duration: N/A, bitrate: N/A Stream #0:0: Video: png, rgba(pc), 1000x200, 25 fps, 25 tbr, 25 tbn, 25 tbc Guessed Channel Layout for Input Stream #1.0 : mono Input #1, wav, from '/tmp/tmpt80hf47t20230613T093215883Z_20s_power_metal,_infectious_ha.wav': Duration: 00:00:08.00, bitrate: 512 kb/s Stream #1:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 32000 Hz, mono, s16, 512 kb/s Stream mapping: Stream #0:0 -> #0:0 (png (native) -> h264 (libx264)) Stream #1:0 -> #0:1 (pcm_s16le (native) -> aac (native)) Press [q] to stop, [?] for help [libx264 @ 0x559b6a8cd700] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2 [libx264 @ 0x559b6a8cd700] profile High, level 3.0, 4:2:0, 8-bit [libx264 @ 0x559b6a8cd700] 264 - core 160 r3011 cde9a93 - H.264/MPEG-4 AVC codec - Copyleft 2003-2020 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=6 lookahead_threads=1 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00 Output #0, mp4, to '/tmp/tmp_v8et_m6.mp4': Metadata: encoder : Lavf58.45.100 Stream #0:0: Video: h264 (libx264) (avc1 / 0x31637661), yuv420p(progressive), 1000x200, q=-1--1, 25 fps, 12800 tbn, 25 tbc Metadata: encoder : Lavc58.91.100 libx264 Side data: cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A Stream #0:1: Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 69 kb/s Metadata: encoder : Lavc58.91.100 aac frame= 200 fps=0.0 q=-1.0 Lsize= 109kB time=00:00:08.00 bitrate= 111.6kbits/s speed=15.1x video:34kB audio:68kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 6.681644% [libx264 @ 0x559b6a8cd700] frame I:1 Avg QP:20.28 size: 7953 [libx264 @ 0x559b6a8cd700] frame P:145 Avg QP:24.36 size: 156 [libx264 @ 0x559b6a8cd700] frame B:54 Avg QP:19.79 size: 68 [libx264 @ 0x559b6a8cd700] consecutive B-frames: 51.0% 36.0% 9.0% 4.0% [libx264 @ 0x559b6a8cd700] mb I I16..4: 60.0% 10.5% 29.5% [libx264 @ 0x559b6a8cd700] mb P I16..4: 1.2% 0.1% 0.2% P16..4: 0.9% 0.4% 0.1% 0.0% 0.0% skip:97.2% [libx264 @ 0x559b6a8cd700] mb B I16..4: 0.4% 0.1% 0.0% B16..8: 2.6% 0.2% 0.0% direct: 0.0% skip:96.6% L0:44.4% L1:55.1% BI: 0.5% [libx264 @ 0x559b6a8cd700] 8x8 transform intra:8.9% inter:42.6% [libx264 @ 0x559b6a8cd700] coded y,uvDC,uvAC intra: 7.9% 22.7% 18.8% inter: 0.1% 0.3% 0.2% [libx264 @ 0x559b6a8cd700] i16 v,h,dc,p: 92% 3% 6% 0% [libx264 @ 0x559b6a8cd700] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 19% 7% 72% 0% 0% 0% 0% 0% 0% [libx264 @ 0x559b6a8cd700] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 75% 6% 14% 0% 1% 2% 1% 1% 1% [libx264 @ 0x559b6a8cd700] i8c dc,h,v,p: 18% 4% 78% 0% [libx264 @ 0x559b6a8cd700] Weighted P-Frames: Y:0.0% UV:0.0% [libx264 @ 0x559b6a8cd700] ref P L0: 75.5% 3.1% 15.8% 5.6% [libx264 @ 0x559b6a8cd700] ref B L0: 87.5% 12.3% 0.2% [libx264 @ 0x559b6a8cd700] ref B L1: 99.9% 0.1% [libx264 @ 0x559b6a8cd700] kb/s:34.28 [aac @ 0x559b6a8cecc0] Qavg: 130.508 ffmpeg version 4.3.6-0+deb11u1 Copyright (c) 2000-2023 the FFmpeg developers built with gcc 10 (Debian 10.2.1-6) configuration: --prefix=/usr --extra-version=0+deb11u1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared libavutil 56. 51.100 / 56. 51.100 libavcodec 58. 91.100 / 58. 91.100 libavformat 58. 45.100 / 58. 45.100 libavdevice 58. 10.100 / 58. 10.100 libavfilter 7. 85.100 / 7. 85.100 libavresample 4. 0. 0 / 4. 0. 0 libswscale 5. 7.100 / 5. 7.100 libswresample 3. 7.100 / 3. 7.100 libpostproc 55. 7.100 / 55. 7.100 Input #0, mov,mp4,m4a,3gp,3g2,mj2, from '/tmp/tmp_v8et_m6.mp4': Metadata: major_brand : isom minor_version : 512 compatible_brands: isomiso2avc1mp41 encoder : Lavf58.45.100 Duration: 00:00:08.03, start: 0.000000, bitrate: 111 kb/s Stream #0:0(und): Video: h264 (High) (avc1 / 0x31637661), yuv420p, 1000x200, 34 kb/s, 25 fps, 25 tbr, 12800 tbn, 50 tbc (default) Metadata: handler_name : VideoHandler Stream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 69 kb/s (default) Metadata: handler_name : SoundHandler Stream mapping: Stream #0:0 -> #0:0 (h264 (native) -> h264 (libx264)) Stream #0:1 -> #0:1 (aac (native) -> aac (native)) Press [q] to stop, [?] for help [libx264 @ 0x5563d8e03f00] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2 [libx264 @ 0x5563d8e03f00] profile High, level 3.1, 4:2:0, 8-bit [libx264 @ 0x5563d8e03f00] 264 - core 160 r3011 cde9a93 - H.264/MPEG-4 AVC codec - Copyleft 2003-2020 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=12 lookahead_threads=2 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00 Output #0, mp4, to '/tmp/padded_waveform.mp4': Metadata: major_brand : isom minor_version : 512 compatible_brands: isomiso2avc1mp41 encoder : Lavf58.45.100 Stream #0:0(und): Video: h264 (libx264) (avc1 / 0x31637661), yuv420p, 1000x666, q=-1--1, 25 fps, 12800 tbn, 25 tbc (default) Metadata: handler_name : VideoHandler encoder : Lavc58.91.100 libx264 Side data: cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A Stream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 69 kb/s (default) Metadata: handler_name : SoundHandler encoder : Lavc58.91.100 aac frame= 178 fps=0.0 q=28.0 size= 0kB time=00:00:07.39 bitrate= 0.1kbits/s speed=14.6x frame= 200 fps=0.0 q=-1.0 Lsize= 113kB time=00:00:08.00 bitrate= 115.3kbits/s speed=12.7x video:37kB audio:68kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 6.771026% [libx264 @ 0x5563d8e03f00] frame I:1 Avg QP:13.05 size: 8743 [libx264 @ 0x5563d8e03f00] frame P:104 Avg QP:14.62 size: 200 [libx264 @ 0x5563d8e03f00] frame B:95 Avg QP:15.87 size: 84 [libx264 @ 0x5563d8e03f00] consecutive B-frames: 25.5% 28.0% 16.5% 30.0% [libx264 @ 0x5563d8e03f00] mb I I16..4: 87.3% 2.5% 10.2% [libx264 @ 0x5563d8e03f00] mb P I16..4: 0.4% 0.0% 0.1% P16..4: 0.2% 0.1% 0.0% 0.0% 0.0% skip:99.1% [libx264 @ 0x5563d8e03f00] mb B I16..4: 0.2% 0.0% 0.0% B16..8: 0.7% 0.1% 0.0% direct: 0.0% skip:99.0% L0:42.2% L1:57.0% BI: 0.9% [libx264 @ 0x5563d8e03f00] 8x8 transform intra:4.1% inter:49.2% [libx264 @ 0x5563d8e03f00] coded y,uvDC,uvAC intra: 6.4% 16.9% 13.6% inter: 0.0% 0.1% 0.1% [libx264 @ 0x5563d8e03f00] i16 v,h,dc,p: 97% 1% 2% 0% [libx264 @ 0x5563d8e03f00] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 30% 7% 60% 2% 0% 0% 0% 0% 1% [libx264 @ 0x5563d8e03f00] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 76% 5% 14% 0% 1% 1% 1% 1% 0% [libx264 @ 0x5563d8e03f00] i8c dc,h,v,p: 49% 2% 48% 0% [libx264 @ 0x5563d8e03f00] Weighted P-Frames: Y:0.0% UV:0.0% [libx264 @ 0x5563d8e03f00] ref P L0: 70.5% 3.8% 16.1% 9.6% [libx264 @ 0x5563d8e03f00] ref B L0: 86.6% 11.5% 1.8% [libx264 @ 0x5563d8e03f00] ref B L1: 99.0% 1.0% [libx264 @ 0x5563d8e03f00] kb/s:37.52 [aac @ 0x5563d8e03840] Qavg: 130.633 ffmpeg version 4.3.6-0+deb11u1 Copyright (c) 2000-2023 the FFmpeg developers built with gcc 10 (Debian 10.2.1-6) configuration: --prefix=/usr --extra-version=0+deb11u1 --toolchain=hardened --libdir=/usr/lib/x86_64-linux-gnu --incdir=/usr/include/x86_64-linux-gnu --arch=amd64 --enable-gpl --disable-stripping --enable-avresample --disable-filter=resample --enable-gnutls --enable-ladspa --enable-libaom --enable-libass --enable-libbluray --enable-libbs2b --enable-libcaca --enable-libcdio --enable-libcodec2 --enable-libdav1d --enable-libflite --enable-libfontconfig --enable-libfreetype --enable-libfribidi --enable-libgme --enable-libgsm --enable-libjack --enable-libmp3lame --enable-libmysofa --enable-libopenjpeg --enable-libopenmpt --enable-libopus --enable-libpulse --enable-librabbitmq --enable-librsvg --enable-librubberband --enable-libshine --enable-libsnappy --enable-libsoxr --enable-libspeex --enable-libsrt --enable-libssh --enable-libtheora --enable-libtwolame --enable-libvidstab --enable-libvorbis --enable-libvpx --enable-libwavpack --enable-libwebp --enable-libx265 --enable-libxml2 --enable-libxvid --enable-libzmq --enable-libzvbi --enable-lv2 --enable-omx --enable-openal --enable-opencl --enable-opengl --enable-sdl2 --enable-pocketsphinx --enable-libmfx --enable-libdc1394 --enable-libdrm --enable-libiec61883 --enable-chromaprint --enable-frei0r --enable-libx264 --enable-shared libavutil 56. 51.100 / 56. 51.100 libavcodec 58. 91.100 / 58. 91.100 libavformat 58. 45.100 / 58. 45.100 libavdevice 58. 10.100 / 58. 10.100 libavfilter 7. 85.100 / 7. 85.100 libavresample 4. 0. 0 / 4. 0. 0 libswscale 5. 7.100 / 5. 7.100 libswresample 3. 7.100 / 3. 7.100 libpostproc 55. 7.100 / 55. 7.100 Input #0, mov,mp4,m4a,3gp,3g2,mj2, from '/tmp/padded_waveform.mp4': Metadata: major_brand : isom minor_version : 512 compatible_brands: isomiso2avc1mp41 encoder : Lavf58.45.100 Duration: 00:00:08.03, start: 0.000000, bitrate: 114 kb/s Stream #0:0(und): Video: h264 (High) (avc1 / 0x31637661), yuv420p, 1000x666, 38 kb/s, 25 fps, 25 tbr, 12800 tbn, 50 tbc (default) Metadata: handler_name : VideoHandler Stream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 69 kb/s (default) Metadata: handler_name : SoundHandler Input #1, png_pipe, from '/tmp/background.png': Duration: N/A, bitrate: N/A Stream #1:0: Video: png, rgb48be(pc), 980x447, 25 tbr, 25 tbn, 25 tbc Stream mapping: Stream #0:0 (h264) -> overlay:main (graph 0) Stream #1:0 (png) -> overlay:overlay (graph 0) overlay (graph 0) -> Stream #0:0 (libx264) Stream #0:1 -> #0:1 (aac (native) -> aac (native)) Press [q] to stop, [?] for help [libx264 @ 0x55d22a508480] using cpu capabilities: MMX2 SSE2Fast SSSE3 SSE4.2 AVX FMA3 BMI2 AVX2 [libx264 @ 0x55d22a508480] profile High, level 3.1, 4:2:0, 8-bit [libx264 @ 0x55d22a508480] 264 - core 160 r3011 cde9a93 - H.264/MPEG-4 AVC codec - Copyleft 2003-2020 - http://www.videolan.org/x264.html - options: cabac=1 ref=3 deblock=1:0:0 analyse=0x3:0x113 me=hex subme=7 psy=1 psy_rd=1.00:0.00 mixed_ref=1 me_range=16 chroma_me=1 trellis=1 8x8dct=1 cqm=0 deadzone=21,11 fast_pskip=1 chroma_qp_offset=-2 threads=12 lookahead_threads=2 sliced_threads=0 nr=0 decimate=1 interlaced=0 bluray_compat=0 constrained_intra=0 bframes=3 b_pyramid=2 b_adapt=1 b_bias=0 direct=1 weightb=1 open_gop=0 weightp=2 keyint=250 keyint_min=25 scenecut=40 intra_refresh=0 rc_lookahead=40 rc=crf mbtree=1 crf=23.0 qcomp=0.60 qpmin=0 qpmax=69 qpstep=4 ip_ratio=1.40 aq=1:1.00 Output #0, mp4, to '/tmp/final_waveform.mp4': Metadata: major_brand : isom minor_version : 512 compatible_brands: isomiso2avc1mp41 encoder : Lavf58.45.100 Stream #0:0: Video: h264 (libx264) (avc1 / 0x31637661), yuv420p, 1000x666, q=-1--1, 25 fps, 12800 tbn, 25 tbc (default) Metadata: encoder : Lavc58.91.100 libx264 Side data: cpb: bitrate max/min/avg: 0/0/0 buffer size: 0 vbv_delay: N/A Stream #0:1(und): Audio: aac (LC) (mp4a / 0x6134706D), 32000 Hz, mono, fltp, 69 kb/s (default) Metadata: handler_name : SoundHandler encoder : Lavc58.91.100 aac frame= 133 fps=0.0 q=28.0 size= 0kB time=00:00:05.56 bitrate= 0.1kbits/s speed=11.1x frame= 200 fps=0.0 q=-1.0 Lsize= 148kB time=00:00:08.00 bitrate= 151.1kbits/s speed=8.59x video:72kB audio:69kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 5.107691% [libx264 @ 0x55d22a508480] frame I:1 Avg QP:18.16 size: 40130 [libx264 @ 0x55d22a508480] frame P:108 Avg QP:19.99 size: 232 [libx264 @ 0x55d22a508480] frame B:91 Avg QP:21.80 size: 83 [libx264 @ 0x55d22a508480] consecutive B-frames: 26.5% 35.0% 10.5% 28.0% [libx264 @ 0x55d22a508480] mb I I16..4: 69.5% 1.7% 28.7% [libx264 @ 0x55d22a508480] mb P I16..4: 0.5% 0.1% 0.1% P16..4: 1.3% 0.1% 0.0% 0.0% 0.0% skip:97.9% [libx264 @ 0x55d22a508480] mb B I16..4: 0.1% 0.0% 0.0% B16..8: 0.9% 0.1% 0.0% direct: 0.0% skip:98.9% L0:44.1% L1:55.2% BI: 0.8% [libx264 @ 0x55d22a508480] 8x8 transform intra:6.5% inter:72.3% [libx264 @ 0x55d22a508480] coded y,uvDC,uvAC intra: 12.9% 30.7% 23.4% inter: 0.2% 0.6% 0.1% [libx264 @ 0x55d22a508480] i16 v,h,dc,p: 94% 4% 2% 0% [libx264 @ 0x55d22a508480] i8 v,h,dc,ddl,ddr,vr,hd,vl,hu: 12% 4% 82% 1% 0% 0% 0% 0% 0% [libx264 @ 0x55d22a508480] i4 v,h,dc,ddl,ddr,vr,hd,vl,hu: 62% 12% 13% 2% 2% 3% 3% 2% 2% [libx264 @ 0x55d22a508480] i8c dc,h,v,p: 40% 9% 50% 1% [libx264 @ 0x55d22a508480] Weighted P-Frames: Y:0.0% UV:0.0% [libx264 @ 0x55d22a508480] ref P L0: 82.1% 0.8% 14.0% 3.2% [libx264 @ 0x55d22a508480] ref B L0: 89.6% 9.4% 1.0% [libx264 @ 0x55d22a508480] ref B L1: 97.9% 2.1% [libx264 @ 0x55d22a508480] kb/s:72.75 [aac @ 0x55d22a502f00] Qavg: 128.918
Want to make some of these yourself?
Run this model