Skip to content

Ultimate Yomitan Audio Source

Tired of this sound?

I have created an audio source that uses several techniques to ensure you always get high quality audio for your Yomitan cards. Here is what it can do:

Match alternative forms

A lot of words have several alternative forms. Other audio sources might not provide valid audio for these alternative forms. Take these forms of the verb "to hit" for example:

打぀ 拍぀ 搏぀ 擣぀ 撲぀

All of these verbs share the same reading う぀ (utsu) and have the same pitch accent pattern. Using the Ultimate Yomitan Audio Source you'll get the correct audio regardless of which kanji form is used.

Cascading match priority

Let's assume you're searching for a really rare word. Existing audio sources might not have an entry for it and not give you any audio. The Ultimate Yomitan Audio Source will cascade through the following match priorities instead:

  1. It prioritizes matching the expression and the reading
  2. Subsequently, it will match the expression only
  3. Finally it will match the reading only
  4. Additionally you can get TTS audio (see below for more information)

This ensures you get the best possible audio for your card. The only thing you have to do is ensure the pitch accent matches, which you can do by examining the available results:

耆旧    

In this example we can observe that we have a 平板 (heiban) pitch accent pattern. This matches with the result we get:

We can thus confidently use this audio for our card.

TTS fallback with nice sounding AI voice

Often, we want to mine vocabulary in highly specific areas, such as names of celebrities, which might not have readily available audio but are relevant to us. We use a high quality AI voice TTS fallback for these cases. You can be confident you will ALWAYS get audio. Take these names for example:

εͺ内逍ι₯  εΉ³ε…Όη››

These names are pronounced perfectly by the TTS engine.

TTS pitch accent awareness

The TTS references known pitch accent patterns for words with multiple pitch accents. You can individually request the pitch accent pattern you want if it's available. For example:

毎ζ—₯  標瀺

If no pitch accent pattern is available the TTS will take a best guess.

No reading - No problem

Some dictionaries have entries without any reading at all. The Ultimate Yomitan Audio Source can handle these entries by making an educated guess at the reading. For example:

δΈ€ζ–Ήηš„γ«

This exists as an entry in ζ–°ε’Œθ‹± without a reading.

πŸ“¦ Installation

Access is available for all users signed up to the $1 tier on my Patreon. No add-on installation is required. Be aware that usage limits apply, but you should not reach them in normal usage.

  1. Subscribe on Patreon β†’ Get your personal API key by authenticating here.

  2. Open Yomitan settings β†’ Navigate to Audio sources β†’ Add a new audio source with the following details:

    Type: Custom URL (JSON)

    URL: The link you were provided with after authentication

  3. Test It Out:

    • Try looking up a word to confirm audio is working properly
    • Enjoy high-quality audio for all your cards!

β†’ SIGN UP ON PATREON NOW ←

A note on batch generation

The Ultimate Yomitan Audio Source can be used in batch generation scripts and does some clean-up of the input parameters that might break other audio sources -- However, if you intend to perform batch generation, it remains your responsibility to ensure the correctness of the input parameters and the resulting audio.

When using the Generate Batch Audio Add-on for Anki add-on, set it up as follows:

Batch Processing Setup

⚠️ Important:

  • Ensure you're putting in the correct field values:
    • ?term= should match your word field (without any extras like fancy HTML)
    • ?reading= should match your reading field (without any extras like fancy HTML)
  • Put in a delay of 0,2 to avoid getting rate limited.
  • Warning: As getting high fidelity AI TTS is expensive you may run into your API limit when generating a very large number of cards.

Size Comparison

This is currently the biggest word audio database in existence. Here is an overview:

Audio Source Entries
Yomitan Ultimate Audio Source 838078 + TTS Fallback
Yomichan Audio Server Entries (Rust Server) 732607 (without Chinese)
Local Audio Yomichan (Anki Add-on) 590410