Moozonian

About 912 results
AI Overview
Generating...
Sponsored • AdSense Integration Active
💡
Did you mean: radio
Corrected by Entity Network
arxiv.org
arxiv.org › abs › 2311.07919v2
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
Recently, instruction-following audio-language models have received broad attention for audio interaction with humans. However, the absence of pre-trained audio models capable of handling diverse audi...
arxiv.org
arxiv.org › abs › 2311.08396v1
Zero-shot audio captioning with audio-language model guidance and audio context keywords
Zero-shot audio captioning aims at automatically generating descriptive textual captions for audio content without prior training for this task. Different from speech recognition which translates audi...
www.amazon.com
amazon.com › dp › B0F8...quire_auto-append-20
Amazon.com: Victrola Wave – Bluetooth Turntable with Auracast – 2-Speed Vinyl Record Player, Audio Technica AT-VM95E Cartridge, Hi-Res aptX HD and Adaptive Bluetooth Streaming, Auracast Broadcast Audio (White) : Electronics
Buy Victrola Wave – Bluetooth Turntable with Auracast – 2-Speed Vinyl Record Player, Audio Technica AT-VM95E Cartridge, Hi-Res aptX HD and Adaptive Bluetooth Streaming, Auracast Broadcast Audio (W...
Sponsored • AdSense Integration Active
www.reddit.com
reddit.com › r › windo..._on_windows_11_with ›
Extending Bluetooth® LE Audio on Windows 11 with shared audio (preview)
Today’s Windows 11 Insider Preview Build (26220.7051) for Dev & Beta Channels begins gradual rollout of **shared audio (preview)**, a new experience being previewed that allows your audio to be...
arxiv.org
arxiv.org › abs › 2309.09836v2
RECAP: Retrieval-Augmented Audio Captioning
We present RECAP (REtrieval-Augmented Audio CAPtioning), a novel and effective audio captioning system that generates captions conditioned on an input audio and other captions similar to the audio ret...
arxiv.org
arxiv.org › abs › 1708.07218v1
Object-Based Audio Rendering
Apparatus and methods are disclosed for performing object-based audio rendering on a plurality of audio objects which define a sound scene, each audio object comprising at least one audio signal and a...
arxiv.org
arxiv.org › abs › 2401.08902v1
Similar but Faster: Manipulation of Tempo in Music Audio Embeddings for Tempo Prediction and Search
Audio embeddings enable large scale comparisons of the similarity of audio files for applications such as search and recommendation. Due to the subjectivity of audio similarity, it can be desirable to...
arxiv.org
arxiv.org › abs › 1810.13137v4
Introducing SPAIN (SParse Audio INpainter)
A novel sparsity-based algorithm for audio inpainting is proposed. It is an adaptation of the SPADE algorithm by Kitić et al., originally developed for audio declipping, to the task of audio inpainti...
arxiv.org
arxiv.org › abs › 2507.16632v3
Step-Audio 2 Technical Report
This paper presents Step-Audio 2, an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation. By integrating a latent audio encoder and r...
arxiv.org
arxiv.org › abs › 2202.10910v1
Sound Adversarial Audio-Visual Navigation
Audio-visual navigation task requires an agent to find a sound source in a realistic, unmapped 3D environment by utilizing egocentric audio-visual observations. Existing audio-visual navigation works ...
arxiv.org
arxiv.org › abs › 2505.20166v3
From Alignment to Advancement: Bootstrapping Audio-Language Alignment with Synthetic Data
Audio-aware large language models (ALLMs) have recently made great strides in understanding and processing audio inputs. These models are typically adapted from text-based large language models (LLMs)...
arxiv.org
arxiv.org › abs › 2409.12962v2
CLAIR-A: Leveraging Large Language Models to Judge Audio Captions
The Automated Audio Captioning (AAC) task asks models to generate natural language descriptions of an audio input. Evaluating these machine-generated audio captions is a complex task that requires con...
arxiv.org
arxiv.org › abs › 2101.00132v1
Audio Content Analysis
Preprint for a book chapter introducing Audio Content Analysis. With a focus on Music Information Retrieval systems, this chapter defines musical audio content, introduces the general process of audio...
arxiv.org
arxiv.org › abs › 2104.11568v1
The Influence of Audio on Video Memorability with an Audio Gestalt Regulated Video Memorability System
Memories are the tethering threads that tie us to the world, and memorability is the measure of their tensile strength. The threads of memory are spun from fibres of many modalities, obscuring the con...
www.reddit.com
reddit.com › r › Retro...and_rsound_as_audio ›
How come I only see OpenSL and RSound as audio drivers in Android? Are these the only default audio drivers in Android?
On Android I remember there was a way to use audiotrack (or was it aaudio?) as well. Where did it go?...
blog.google
blog.google › products › a...dio-auracast-support
LE Audio Auracast support expands to more Android devices
We’re announcing LE Audio Auracast support on more phones and headphones, and introducing a new way to share audio to multiple headphones.
store.google.com
store.google.com › us › pr...tm_campaign=GS107430
Nest Audio, Amazing sound at your command - Google Store
Nest Audio is a premium smart speaker providing whole home audio in a compact system.
www.wikidata.org
wikidata.org › wiki › Q48996222
generative audio - Wikidata
creation of audio files from databases of audio clips
www.bing.com
bing.com › ck › a?!&am...AvODE5MDU3&ntb=1
[DRIVERS] Realtek Audio (AMD 3xx/4xx/5xx/6xx/8xx &... - Republic of ...
Apr 20, 2020 · ASUS ROG / TUF / PRIME /ProArt Realtek motherboards : Install/Update Process : CLEANUP /!\ If you already had Realtek (HD) Audio Driver, Realtek Audio Control/Console installed …
arxiv.org
arxiv.org › abs › 2206.06108v3
Language-based Audio Retrieval Task in DCASE 2022 Challenge
Language-based audio retrieval is a task, where natural language textual captions are used as queries to retrieve audio signals from a dataset. It has been first introduced into DCASE 2022 Challenge a...
en.wikipedia.org Wikipedia
en.wikipedia.org › wiki › Audio
Audio - Wikipedia
up audio in Wiktionary, the free dictionary. Audio most commonly refers to sound, as it is transmitted in signal form. It may also refer to: Audio signal
www.reddit.com Reddit
reddit.com › r › EndDe... › 1p3k7q2 › audio_porn ›
Audio porn?
I’m committed to the 15 min rule but I struggle to stay aroused the entire time as I wean myself off porn. Is there any middle ground that works for people like audio porn or smut? I’m trying to b...
github.com GitHub
github.com › pytorch › audio
pytorch/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch (⭐ 2834)
9to5mac.com HackerNews
9to5mac.com › 2019 › 01 › 28 › facetime-bug-hear-audio ›
FaceTime bug lets you hear audio of person you are calling before they pick up
Points: 1534 | Comments: 429 | Author: uptown
arxiv.org arXiv
arxiv.org › abs › 2305.15266v3
Diffusion-Based Audio Inpainting
Audio inpainting aims to reconstruct missing segments in corrupted recordings. Most of existing methods produce plausible reconstructions when the gap lengths are short, but struggle to reconstruct ga...
en.wikipedia.org Wikipedia
en.wikipedia.org › wiki › MP3
MP3 - Wikipedia
MP3 (formally MPEG-1 Audio Layer III or MPEG-2 Audio Layer III) is an audio coding format developed largely by the Fraunhofer Society in Germany under
www.reddit.com Reddit
reddit.com › r › relat...ornerotic_audio_why ›
My wife listening to audio porn/erotic audio. Why am I having a hard time moving on?
Our van does this annoying thing where Bluetooth audio will disconnect from one of our phones then connect to the other's phone. My wife (36 F) and I (41 M) were driving with our three kids as well as...
github.com GitHub
github.com › PaulStoffregen › Audio
PaulStoffregen/Audio
Teensy Audio Library (⭐ 1214)
hacks.mozilla.org HackerNews
hacks.mozilla.org › 2019...ble-video-and-audio ›
Firefox 66 to block automatically playing audible video and audio
Points: 1415 | Comments: 368 | Author: mfsch
arxiv.org arXiv
arxiv.org › abs › 1908.02590v3
Audio-visual Speech Enhancement Using Conditional Variational Auto-Encoders
Variational auto-encoders (VAEs) are deep generative latent variable models that can be used for learning the distribution of complex data. VAEs have been successfully used to learn a probabilistic pr...
en.wikipedia.org Wikipedia
en.wikipedia.org › wiki › Audio_crossover
Audio crossover - Wikipedia
Audio crossovers are a type of electronic filter circuitry that splits an audio signal into two or more frequency ranges, so that the signals can be sent
www.reddit.com Reddit
reddit.com › r › Dasha...uz05 › audio_original ›
Audio original
...
github.com GitHub
github.com › AudioKit › AudioKit
AudioKit/AudioKit
Audio synthesis, processing, & analysis platform for iOS, macOS and tvOS (⭐ 11287)
arxiv.org arXiv
arxiv.org › abs › 2411.18222v1
Towards Improved Objective Perceptual Audio Quality Assessment -- Part 1: A Novel Data-Driven Cognitive Model
Efficient audio quality assessment is vital for streamlining audio codec development. Objective assessment tools have been developed over time to algorithmically predict quality ratings from subjectiv...
en.wikipedia.org Wikipedia
en.wikipedia.org › wiki › Audio_normalization
Audio normalization - Wikipedia
Audio normalization is the application of a constant amount of gain to an audio recording to bring the amplitude to a target level (the norm). Because
www.reddit.com Reddit
reddit.com › r › ldsse...vliqt › audio_erotica ›
Audio Erotica
I know this is borderline for many here, but I am kind of having a hard time not liking it. I am against watching porn or my husband watching porn as its just too graphic and bothers me a lot with the...
github.com GitHub
github.com › AIGC-Audio › AudioGPT
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head (⭐ 10207)
arxiv.org arXiv
arxiv.org › abs › 2505.20166v3
From Alignment to Advancement: Bootstrapping Audio-Language Alignment with Synthetic Data
Audio-aware large language models (ALLMs) have recently made great strides in understanding and processing audio inputs. These models are typically adapted from text-based large language models (LLMs)...
en.wikipedia.org Wikipedia
en.wikipedia.org › wiki › Compact_disc
Compact disc - Wikipedia
play digital audio recordings. It employs the Compact Disc Digital Audio (CD-DA) standard and is capable of holding uncompressed stereo audio. First released
www.reddit.com Reddit
reddit.com › r › TikTo...6e › whimpering_audio ›
Whimpering Audio
...
github.com GitHub
github.com › facebookresearch › audiocraft
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable mu...
arxiv.org arXiv
arxiv.org › abs › 2105.01531v2
VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive Coding
Influenced by the field of Computer Vision, Generative Adversarial Networks (GANs) are often adopted for the audio domain using fixed-size two-dimensional spectrogram representations as the "image dat...
en.wikipedia.org Wikipedia
en.wikipedia.org › wiki › Cassette_tape
Cassette tape - Wikipedia
called Compact Cassette, audio cassette, or simply tape or cassette, is an analog magnetic tape recording format for audio recording and playback. Invented
www.reddit.com Reddit
reddit.com › r › inter...n_beans_and_calcium ›
Giant snails eating green beans and calcium powder with high-quality audio.
...
github.com GitHub
github.com › audacity › audacity
audacity/audacity
Audio Editor (⭐ 16566)
arxiv.org arXiv
arxiv.org › abs › 2501.04116v3
dCoNNear: An Artifact-Free Neural Network Architecture for Closed-loop Audio Signal Processing
Recent advances in deep neural networks (DNNs) have significantly improved various audio processing applications, including speech enhancement, synthesis, and hearing-aid algorithms. DNN-based closed-...
en.wikipedia.org Wikipedia
en.wikipedia.org › wiki › Audio_mining
Audio mining - Wikipedia
Audio mining is a technique by which the content of an audio signal can be automatically analyzed and searched. It is most commonly used in the field of
www.reddit.com Reddit
reddit.com › r › Under...terfering_with_2020 ›
Leaked audio of Trump interfering with 2020 election result
Source: https://www.youtube.com/watch?v=-mPQsG_TJb4 https://en.wikipedia.org/wiki/Trump%E2%80%93Raffensperger_phone_call...
github.com GitHub
github.com › mattgallagher › AudioStreamer
mattgallagher/AudioStreamer
A streaming audio player class (AudioStreamer) for Mac OS X and iPhone. (⭐ 1943)
arxiv.org arXiv
arxiv.org › abs › 2311.08396v1
Zero-shot audio captioning with audio-language model guidance and audio context keywords
Zero-shot audio captioning aims at automatically generating descriptive textual captions for audio content without prior training for this task. Different from speech recognition which translates audi...
en.wikipedia.org Wikipedia
en.wikipedia.org › wiki › Audio_restoration
Audio restoration - Wikipedia
Audio restoration is the process of removing imperfections (such as hiss, impulse noise, crackle, wow and flutter, background noise, and mains hum) from
www.reddit.com Reddit
reddit.com › r › self › ...ng_about_barging_in ›
Played Audio of Trump Bragging About Barging In On Naked Underage Girls Tonight
Don’t know where else to post this… So, my mom - who I love - has a real hate towards Hunter Biden and a bunch of others. Ok, I get it, sometimes we dislike the other team. Tonight we were chat...
github.com GitHub
github.com › advplyr › audiobookshelf
advplyr/audiobookshelf
Self-hosted audiobook and podcast server (⭐ 11918)
arxiv.org arXiv
arxiv.org › abs › 2102.01243v3
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and Aggregation
Audio tagging is an active research area and has a wide range of applications. Since the release of AudioSet, great progress has been made in advancing model performance, which mostly comes from the d...
en.wikipedia.org Wikipedia
en.wikipedia.org › wiki › IPA_consonant_chart_with_audio
IPA consonant chart with audio - Wikipedia
This article includes inline links to audio files. If you have trouble playing the files, see Wikipedia Media help. This article contains phonetic transcriptions
www.reddit.com Reddit
reddit.com › r › polit...w_cuomo_counting_on ›
Stunning Audio Reveals Andrew Cuomo Counting on Trump to Help Him Win
...
github.com GitHub
github.com › DrewThomasson › ebook2audiobook
DrewThomasson/ebook2audiobook
Generate audiobooks from e-books, voice cloning & 1158+ languages! (⭐ 18344)
arxiv.org arXiv
arxiv.org › abs › 2311.07919v2
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models
Recently, instruction-following audio-language models have received broad attention for audio interaction with humans. However, the absence of pre-trained audio models capable of handling diverse audi...
en.wikipedia.org Wikipedia
en.wikipedia.org › wiki › Audio_compression
Audio compression - Wikipedia
Audio compression may refer to: Audio compression (data), a type of lossy or lossless compression in which the amount of data in a recorded waveform is
www.reddit.com Reddit
reddit.com › r › singu...ut_it_feels_so_real ›
Both video and audio is AI but it feels so real
...
github.com GitHub
github.com › katspaugh › wavesurfer.js
katspaugh/wavesurfer.js
Audio waveform player (⭐ 10135)
arxiv.org arXiv
arxiv.org › abs › 2101.00132v1
Audio Content Analysis
Preprint for a book chapter introducing Audio Content Analysis. With a focus on Music Information Retrieval systems, this chapter defines musical audio content, introduces the general process of audio...