Ultimate Yomitan Audio Source¶

Tired of this sound?

I have created the biggest audio source that not only has the biggest collection of human pronunciations but also uses several techniques to ensure you always get high quality audio for your Yomitan cards. Here is what it can do:

Match alternative forms¶

A lot of words have several alternative forms. Other audio sources might not provide valid audio for these alternative forms. Take these forms of the verb "to hit" for example:

打つ　拍つ　搏つ　擣つ　撲つ

All of these verbs share the same reading うつ (utsu) and have the same pitch accent pattern. Using the Ultimate Yomitan Audio Source you'll get the correct audio regardless of which kanji form is used.

Cascading match priority¶

Let's assume you're searching for a really rare word. Existing audio sources might not have an entry for it and not give you any audio. The Ultimate Yomitan Audio Source will cascade through the following match priorities instead:

It prioritizes matching the expression and the reading
Subsequently, it will match the expression only
Finally it will match the reading only
Additionally you can get TTS audio (see below for more information)

This ensures you get the best possible audio for your card. The only thing you have to do is ensure the pitch accent matches, which you can do by examining the available results:

耆旧

In this example we can observe that we have a 平板 (heiban) pitch accent pattern. This matches with the result we get:

We can thus confidently use this audio for our card.

TTS fallback with nice sounding AI voice¶

Often, we want to mine vocabulary in highly specific areas, such as names of celebrities, which might not have readily available audio but are relevant to us. We use a high quality AI voice TTS fallback for these cases. You can be confident you will ALWAYS get audio. Take these names for example:

坪内逍遥  平兼盛

These names are pronounced perfectly by the TTS engine.

TTS pitch accent awareness¶

The TTS references known pitch accent patterns for words with multiple pitch accents. You can individually request the pitch accent pattern you want if it's available. For example:

毎日  標示

If no pitch accent pattern is available the TTS will take a best guess.

TTS pitch accent override (NEW)¶

Mining a word without an entry in the pitch accent database and the TTS guesses the wrong pitch accent? You can now generate every possible pitch accent variation there is including vowel variations. Let's look at a few example:

Theoretically a word read as こう could be read in 4 ways: コウ、コー、コ'ウ、コ'ー

The Ultimate Yomitan Audio Source will now allow you to generate all of these variations letting you ensure you get correct pitch and pronunciation no matter how obscure the word is!

Let's look at one more example: 平成 (へいせい). In addition to giving you the TTS with the pitch from the database (平板 in this case), it will also let you generate every possible variation of the word.

Pick your sources (NEW)¶

Only want human audio and no TTS? No problem. You can now pick and choose what sources you want to include.

No reading - No problem¶

Some dictionaries have entries without any reading at all. The Ultimate Yomitan Audio Source can handle these entries by making an educated guess at the reading. For example:

一方的に

This exists as an entry in 新和英 without a reading.

📦 Installation¶

You have two options:

Run it yourself following the instructions here: https://github.com/friedrich-de/yomitan-ultimate-audio
Sign up for the 1$ tier on my Patreon and get access to the hosted version of the Ultimate Yomitan Audio Source. This is the easiest way to get started and you don't have to worry about setting anything up.

Subscribe on Patreon → Get your personal API key by authenticating here.
Open Yomitan settings → Navigate to Audio sources → Add a new audio source with the following details:

Type: Custom URL (JSON)

URL: The link you were provided with after authentication
Test It Out:
- Try looking up a word to confirm audio is working properly
- Enjoy high-quality audio for all your cards!

→ SIGN UP ON PATREON NOW ←

A note on batch generation¶

The Ultimate Yomitan Audio Source can be used in batch generation scripts and does some clean-up of the input parameters that might break other audio sources -- However, if you intend to perform batch generation, it remains your responsibility to ensure the correctness of the input parameters and the resulting audio.

When using the Generate Batch Audio Add-on for Anki add-on, set it up as follows:

⚠️ Important:

Ensure you're putting in the correct field values:
- ?term= should match your word field (without any extras like fancy HTML)
- ?reading= should match your reading field (without any extras like fancy HTML)
Put in a delay of 0,2 to avoid getting rate limited.
Warning: As getting high fidelity AI TTS is expensive you may run into your API limit when generating a very large number of cards.

Size Comparison¶

This is currently the biggest word audio database in existence. Here is an overview:

Audio Source	Entries
Yomitan Ultimate Audio Source	877464 + TTS Fallback
Yomichan Audio Server Entries (Rust Server)	732607 (without Chinese)
Local Audio Yomichan (Anki Add-on)	590410