Hi, thanks for expressing your concern about this!
Unfortunately, you are correct about the degredation in quality. The software we use (Voicepeak) has some technical limitations with its accuracy for reading things due to its inability to pronounce kanji with the accurate reading 100% of the time. Due to this, we had a lot of weird mistakes in the original batch of audio where something like ひと would be read as じん or にん and a lot of similar things.
In order to get around this, we tried programmatically removing the kanji from everything so that the correct kana would be read 100% of the time, but this resulted in what we have now where the audio can’t seem to tell where one word ends and another begins, creating very strange pronunciations. In our small scale test we didn’t have this issue, but obviously with the massive bulk input, there turned out to be many cases that confused the software. In retrospect, we should have held off on implementing this until we tested that everything was correct, and apologize for the inconvenience!
For now, we are looking at rolling back to the old audio files, as there were far fewer mistakes. Additionally, we have also started looking into some other alternatives such as 音読さん and Microsoft Azure for creating audio for sentences, as they are both far far more reliable in terms of getting the readings of kanji correct, and are the current top picks for lifelike TTS.