FFMpeg 获取音频音量、提高音量

查看音量

准备原生音频original.mp3

查看original.mp3的音量信息:

bash 复制代码
ffmpeg -i original.mp3 -filter_complex volumedetect -c:v copy -f null /dev/null

输出:

bash 复制代码
Input #0, mp3, from 'original.mp3':
  Metadata:
    artist          : Administrator
    encoder         : Studio One 6.0.1.90430
    TBPM            : 120
    date            : 2023
  Duration: 00:00:26.41, start: 0.025057, bitrate: 320 kb/s
  Stream #0:0: Audio: mp3, 44100 Hz, stereo, fltp, 320 kb/s
    Metadata:
      encoder         : LAME3.100
[Parsed_volumedetect_0 @ 0x55d0ba56dac0] n_samples: 0
Stream mapping:
  Stream #0:0 (mp3float) -> volumedetect:default
  volumedetect:default -> Stream #0:0 (pcm_s16le)
Press [q] to stop, [?] for help
Output #0, null, to '/dev/null':
  Metadata:
    artist          : Administrator
    date            : 2023
    TBPM            : 120
    encoder         : Lavf60.3.100
  Stream #0:0: Audio: pcm_s16le, 44100 Hz, stereo, s16, 1411 kb/s
    Metadata:
      encoder         : Lavc60.3.100 pcm_s16le
size=N/A time=00:00:26.35 bitrate=N/A speed= 684x    A    
video:0kB audio:4544kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: unknown
[Parsed_volumedetect_0 @ 0x55d0ba59c280] n_samples: 2326514
[Parsed_volumedetect_0 @ 0x55d0ba59c280] mean_volume: -35.5 dB
[Parsed_volumedetect_0 @ 0x55d0ba59c280] max_volume: -13.8 dB
[Parsed_volumedetect_0 @ 0x55d0ba59c280] histogram_13db: 8
[Parsed_volumedetect_0 @ 0x55d0ba59c280] histogram_14db: 99
[Parsed_volumedetect_0 @ 0x55d0ba59c280] histogram_15db: 189
[Parsed_volumedetect_0 @ 0x55d0ba59c280] histogram_16db: 191
[Parsed_volumedetect_0 @ 0x55d0ba59c280] histogram_17db: 273
[Parsed_volumedetect_0 @ 0x55d0ba59c280] histogram_18db: 596
[Parsed_volumedetect_0 @ 0x55d0ba59c280] histogram_19db: 1003

音量平均值:mean_volume: -35.5 dB

音量最大值:max_volume: -13.8 dB

提高音量

bash 复制代码
ffmpeg -i original.mp3 -af "volume=3.0,highpass=f=200,lowpass=f=3000,afftdn=nr=50" original.volume=3.0,highpass=f=200,lowpass=f=3000,afftdn=nr=50.wav

参数简单说明下:

volume:提供响度,3.0表示提高为原来的3倍;

hightpass:高通滤波器;

lowpass:低通滤波器;

afftdn:使用FFT对音频样本进行降噪

输出:

bash 复制代码
Input #0, mp3, from 'original.mp3':
  Metadata:
    artist          : Administrator
    encoder         : Studio One 6.0.1.90430
    TBPM            : 120
    date            : 2023
  Duration: 00:00:26.41, start: 0.025057, bitrate: 320 kb/s
  Stream #0:0: Audio: mp3, 44100 Hz, stereo, fltp, 320 kb/s
    Metadata:
      encoder         : LAME3.100
Stream mapping:
  Stream #0:0 -> #0:0 (mp3 (mp3float) -> pcm_s16le (native))
Press [q] to stop, [?] for help
Output #0, wav, to 'original.volume=3.0,highpass=f=200,lowpass=f=3000,afftdn=nr=50.wav':
  Metadata:
    IART            : Administrator
    ICRD            : 2023
    TBPM            : 120
    ISFT            : Lavf60.3.100
  Stream #0:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 44100 Hz, stereo, s16, 1411 kb/s
    Metadata:
      encoder         : Lavc60.3.100 pcm_s16le
size=    4544kB time=00:00:26.37 bitrate=1411.4kbits/s speed= 165x       
video:0kB audio:4544kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: 0.002450%

在提高音量后,再看看音量信息:

bash 复制代码
ffmpeg -i original.volume\=3.0\,highpass\=f\=200\,lowpass\=f\=3000\,afftdn\=nr\=50.wav -filter_complex volumedetect -c:v copy -f null /dev/null

输出:

bash 复制代码
Input #0, wav, from 'original.volume=3.0,highpass=f=200,lowpass=f=3000,afftdn=nr=50.wav':
  Metadata:
    artist          : Administrator
    date            : 2023
    encoder         : Lavf60.3.100
  Duration: 00:00:26.38, bitrate: 1411 kb/s
  Stream #0:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 44100 Hz, 2 channels, s16, 1411 kb/s
[Parsed_volumedetect_0 @ 0x55819c4a4200] n_samples: 0
Stream mapping:
  Stream #0:0 (pcm_s16le) -> volumedetect:default
  volumedetect:default -> Stream #0:0 (pcm_s16le)
Press [q] to stop, [?] for help
Output #0, null, to '/dev/null':
  Metadata:
    artist          : Administrator
    date            : 2023
    encoder         : Lavf60.3.100
  Stream #0:0: Audio: pcm_s16le, 44100 Hz, stereo, s16, 1411 kb/s
    Metadata:
      encoder         : Lavc60.3.100 pcm_s16le
size=N/A time=00:00:26.35 bitrate=N/A speed=1.88e+03x     
video:0kB audio:4544kB subtitle:0kB other streams:0kB global headers:0kB muxing overhead: unknown
[Parsed_volumedetect_0 @ 0x55819c4c35c0] n_samples: 2326514
[Parsed_volumedetect_0 @ 0x55819c4c35c0] mean_volume: -33.3 dB
[Parsed_volumedetect_0 @ 0x55819c4c35c0] max_volume: -7.7 dB
[Parsed_volumedetect_0 @ 0x55819c4c35c0] histogram_7db: 6
[Parsed_volumedetect_0 @ 0x55819c4c35c0] histogram_8db: 18
[Parsed_volumedetect_0 @ 0x55819c4c35c0] histogram_9db: 63
[Parsed_volumedetect_0 @ 0x55819c4c35c0] histogram_10db: 78
[Parsed_volumedetect_0 @ 0x55819c4c35c0] histogram_11db: 123
[Parsed_volumedetect_0 @ 0x55819c4c35c0] histogram_12db: 236
[Parsed_volumedetect_0 @ 0x55819c4c35c0] histogram_13db: 324
[Parsed_volumedetect_0 @ 0x55819c4c35c0] histogram_14db: 474
[Parsed_volumedetect_0 @ 0x55819c4c35c0] histogram_15db: 732
[Parsed_volumedetect_0 @ 0x55819c4c35c0] histogram_16db: 1144

音量平均值:mean_volume: -33.3 dB

音量最大值:max_volume: -7.7 dB

相关推荐
音视频牛哥6 小时前
SmartMediaKit:如何让智能系统早人一步“跟上现实”的时间架构--从实时流媒体到系统智能的演进
人工智能·计算机视觉·音视频·音视频开发·具身智能·十五五规划具身智能·smartmediakit
音视频牛哥7 小时前
超清≠清晰:视频系统里的分辨率陷阱与秩序真相
人工智能·机器学习·计算机视觉·音视频·大牛直播sdk·rtsp播放器rtmp播放器·smartmediakit
johnny2337 小时前
AI视频创作工具汇总:MoneyPrinterTurbo、KrillinAI、NarratoAI、ViMax
人工智能·音视频
EasyCVR11 小时前
视频融合平台EasyCVR级联失败问题排查:请求上级播放后,视频为何无法打开?
音视频
ACP广源盛1392462567312 小时前
(ACP广源盛)GSV2231---DisplayPort 1.4 MST 到 HDMI 2.0/DP/Type-C 转换器(带嵌入式 MCU)
c语言·开发语言·单片机·嵌入式硬件·音视频·mst
范纹杉想快点毕业13 小时前
12个月嵌入式进阶计划ZYNQ 系列芯片嵌入式与硬件系统知识学习全计划(基于国内视频资源)
c语言·arm开发·单片机·嵌入式硬件·学习·fpga开发·音视频
Hody9117 小时前
【XR技术介绍】空间音频(Spatial Audio):原理是什么?如何让声音听起来像是从你身后传来的?
音视频·xr
jiushun_suanli18 小时前
AI生成音频:技术概述与实践指南
人工智能·经验分享·音视频
地狱为王18 小时前
Unity使用RVM实现实时人物视频抠像(无绿幕)
unity·游戏引擎·音视频
我科绝伦(Huanhuan Zhou)18 小时前
Oracle AWR管理与快照操作完整指南
数据库·oracle·ffmpeg