<![CDATA[UTAU.SEXY - Blog]]>Tue, 06 Oct 2020 16:47:31 -0700Weebly<![CDATA[How to Use & Set-Up Presamp/AutoCVVC 2.0 for UTAU]]>Fri, 01 May 2020 07:00:00 GMThttp://utau.sexy/blog/presamp_autocvvc-setup-useMakku & Violin
Hello! So, just a quick disclaimer, this guide was written using both Makku's and Violin's banks as a base and there's no guarantee that this will work perfectly with every single bank. For the most part this should work with most banks, baring weird aliasing systems or errors. 

If you would like a video version of this tutorial, please see Here

Downloads

Both of these tools, along with the base for the reclist for the Japanese voicebank were made by Delta, go check out their blog, they make tons of cool material for UTAU!
Extra features (check UST in extra folder): (For Reese JPN)
  • Glottal stops (triggered by ・ or . in romaji);
  • Vocal fries (triggered by ');
  • Breaths (吸, 吸vowel, brvowel, brnumber, br, breath);
  • Consonant sounds (k, ky, g, gy, t, ty, d, dy, ch, ts, b, by, p, py, r, ry, sh, s);
  • L and V addons (ヴぁ, ラ, etc...)
  • Falsetto (triggered by F);
  • Romaji.
Extra features: (For AERIS ​CV-VC Japanese Natural 2.0)
  • Glottal stops (triggered by ・ or . in romaji);
  • Ending Breaths (triggered by, inhale/exhale/exhale-inhale);
  • Ending V C- (triggered by C or C- followed by a R)
  • Consonant to Consonant support (f, k, p, s, sh, ts);
  • T/th, D/dh, L, English R, Z/zh, rr (rolled Rs), and V addons (ヴぁ, ラ, etc...)
  • Different ん sounds (See ReadMe for more info)
  • Romaji.
Extra features (check UST in extra folder): (For Reese ENG)
  • Glottal stops (triggered by .);
  • Vocal fries (triggered by ');
  • Presamp/autoCVVC compatibility;
  • Endbreaths (triggered by h).

How to use Presamp:

​Download presamp from the link provided above. Open "presamp08996.zip" and put the files "wavtoolex.exe" and "presamp.exe" in UTAU's installation folder. Open the folder "hook4presamp20140614" and extract the "dummy" folder inside it in UTAU's plugin folder (you can find it by opening UTAU, going to "Tools(T)", then "Plug-Ins(N)" and then clicking on "Open Plug-Ins Folder(O)"). Open "predit1730.zip" and extract the "predit" folder in the same plugin folder.
​IMPORTANT: If you haven't already, please make sure your Locale is set to Japanese, that your time format is also set to Japanese and that the decimal symbol is a . (dot) as opposed to , (comma). Apparently only the first step is necessary for American systems, but if you're having issues try all of them.
​Go to "Tools(T)", then "Option(O)..." and change these options:
- Rendering| Turn off all options except for the first one (first one is optional);
- Cache| Turn on "Cache intermediate files". (Remove cache files at quit is recommended)
​That should be it! Open your UST (both CV and VCV will work, but VCV sounds less smooth on my end, so I recommend you convert the UST to CV format, doesn't matter if romaji or hiragana), choose the voicebank, set wavtool and resampler BOTH as "presamp.exe" and make sure to go to the Plugins and open the dummy plugin! If presamp is glitching out you can use the dummy plugin each time to fix it. You can use the predit plugin to change which wavtool and resampler you'd like to use. Additionally, if presamp keeps crashing try to go into the predit plugin and setting the number of bats to a lower number.
Additional notes for the English banks:
  • The voicebank (Reese ENG, found here on Reese's page) has full support for presamp! To use it simply type the phonetics you wish (list below) for each syllable.
    e.g.: This is Reese = [DIs] [Iz] [ris]
  • Ending consonants/vowels are made automatically by presamp or autoCVVC so no need to worry about those.
  • If you don't wanna use presamp, we recommend you type in the phonetics, then use autoCVVC on each sentence separately to fine tune as needed. It's also important to go slow when using autoCVVC as it has a tendency to mess up timing. 
  • IMPORANT NOTE: [l] is only applicable to words/syllables that start with l, for everything else PLEASE use [5]
Known issues:
  • Consonant clusters are hit or miss, I suggest adding a separate note for them (issue only present in LITE reclist).
    Recommended solution: cry = [kr] [aI] or [kr] [raI]. [kraI] may work also work, but it often glitches out.
  • Vowel extensions aren't functional for some reason we haven't figured out. If you wanna extend a vowel you may encounter some issues.
    Recommended solution: using !(vowel). The [!] makes it so that presamp stops being "automatic" for that specific note.
  • Number errors are caused when a specific phonetic combo can also resemble a voicebank's pitch.
    Recommended solution: try avoiding using combos like E4 or A5, for example by putting numbers in the next note.
    e.g.: better = [bE4] [3] this is likely to glitch, so try [bE] [43]
  • Lack of P0. Presamp has a feature that allows the flag "P0" to be used for specific notes (mostly clusters) to avoid volume contrast. However through our testing this feature wasn't working. This isn't a major issue, but it's worth noting.
    Recommended solution: ignore the issue or use P0 globally (this may make mixing harder, but will probably result in better quality overall)
    IMPORTANT: not all resamplers support the P0 flag. P0 cancels UTAU's normalization, so that the volume is equal to the original recording's. For example, tn_fnds does not support this flag.
  • Instability. This method is mostly working fine, but if it may be unstable on certain notes, especially in regards to consonants.
    Recommended solution: split the consonants manually. This may be especially needed on faster songs.
    e.g.: picture = [pI] [k] [tS] [3]

How to use AutoCVVC 2.0:

​Download autoCVVC from the link provided above. Open "autoCVVC2.002.zip" and extract the "autoCVVC2.002" folder in UTAU's plugin folder (refer to presamp section for where to find it). Open UTAU and select the notes you'd like to convert to CVVC, then open the autoCVVC plugin.
  1. Change the language from "Ja" to "En" (bottom left).
  2. Make sure these options are selected (these may change depending on the bank):
  • Optimize;
  • Use [-CV];
  • Use ending note;
  • Allow replace;
  • Allow split;
  • Use presamp; (optional, better if used with combination with Presamp... if that's your sort of thing...)
  • Adjust param (optional, but recommended. Though this can be quite buggy and if you experience glitches it is recommended that you clear the UST and not use this option/the option below for the remainder of the UST);
  • Crossfade (optional, but recommended. Though this can be quite buggy and if you experience glitches it is recommended that you clear the UST and not use this option/the option above for the remainder of the UST);.
Click submit and you're done!
Additional notes for the English banks:
It is recommended that you use Presamp for the English VBs but if you want to use AutoCVVC, we recommend going very slow and checking constantly for errors. Also, it should be noted that we have not done extensive testing with AutoCVVC as our main goal was Presamp support. It may preform differently or similarly to Presamp but thus far our testing has shown us that it is very similar. 
]]>
<![CDATA[Converting VCCV English USTs for use with AERIS CV-VC ENG Divine (Kire 2.0)]]>Wed, 01 Jan 2020 08:00:00 GMThttp://utau.sexy/blog/converting-vccv-english-usts-for-use-with-aeris-cv-vc-eng-divine-kire-20Violin

SoundCloud Twitter

Hello! Violin here and I'm going to try to explain to you how to quickly convert a UST from standard VCCV English to work correctly and fully with AERIS CV-VC English Divine (or Kire 2.0). This list uses a lot of elements of VCCV and if you're not familiar with VCCV, I recommend that you watch Cz's tutorials on it, link here

Also this guide might be a bit complex as it already assumes that you have some knowledge with UTAU and more specifically, UTAU English and VCCV English. 

Alright, without further ado, let's start. I also just want to mention that the UST that I'm going to be using for this is the same one that Cz uses in their tutorials. (Link here) Also, I won't be going over tuning, mostly because I'm just Bad at it. 

Alright so here's the first section of the UST! Now as you can see there's some ! which we can fix. For reference, the lyrics for this portion of the UST are "So there it goes again". 

Now for starters we can replace the [dhA] with [dhAr] because ar/Ar/Er/0r are full vowels! Meaning they have -CV/V C/CV/VC- as well as -V/V/V- and some limited VV, I'll touch on these in a min. 
2nd, with regards to the Ar- ri portion, there's 2 ways to do it. I recommend that for any and all VVs with ar/Ar/Er/0r you do VC V, as is in the original VCCV. So [Ar-][ri] would be replaced with [Ar i].
But what about that 2nd way to do it? Good question, now I want to say that I do not recommend using this method over using the VC V but it is still here. We're going to replace [Ar-] [ri] with [Ar *] [* ri]. This chops up the [Ar i] from a VC V into more of a CVVC format. 
However, I find it a bit less smooth and more work so in these cases, I just stick with the VC V.
Now, with regards to the [it-], this bank has glottal stops! Which in these cases I would use, but it's up to personal preference really. Also, the glottal stops are accessible via ['] and the ending glottal stops (which I would use in this instance) are accessible via [V'-]
For the rest of this part of the UST, the "goes again", it remains the same.
Now we move on to another part of the UST! For this part the lyrics are "​I've lost it, now it's time to pretend"
Right off the bat, you can see there's a few spots we have to adjust here too. 

Let's start with an import one, the [Iv], unlike standard VCCV English this bank does not have any VC. Which are aliases like [Iv]/[9s]/etc. This bank DOES have both V C (aliases such as [x g] and [O z]) as well as VC- (ending consonant aliases such as [en-]/[Oz-]/etc). However, this bank does have full C Cs! So, we're going to replace the [Iv] with [Iv-] and then we're going to add in a [v l].
Also, an important note. When using C Cs it's important to either use the flag [P0] or by turning down the vowel of that specific note to around 60/40. Or by using a combination of both. 
We're going to continue with this trend of using C C for the following [9s] section too, replacing it with [9s][s t]. Also something to note here is that you could also replace it with [9 s][sti] as st is a CCV and is recorded. I'm also going to replace the [it-] with a glottal stop, as shown before. 
Continuing on with the next part, "now it's time to pretend" there's a lot of work to be done here too. 
For starters, we're going to replace the [ts] with [ts-], move the [it-] note back a bit, and also replace the [tI] with [stI]. 
Now with the next section the [Im], we have a few choices. Remember when I said that there aren't any VCs? Yeah there's actually a few VCs but they only are for m,n, and ng VC. So for [Im] we can do [Im] or [Im-] [m t], I'm going to do ​[Im] because it's a very short transition. 
Next, we're going to convert the [pr][_re] into a CCV, as this bank has full CCV support! 
We're going to leave the rest alone as there's not much else we can do here. 
On to the next part! For this part, the lyrics are "that I'm still not a lost cause"
Now, for starters we're going to change the [Im] into [Im-] [m s] as this is a longer transition. Please note that unlike Standard VCCV English [m st] is not in this list and must be submitted for [m s]. This goes for all C CC.
Next, we're going to change [il] into [il n]! As there is full support for Vl C! 
For the next option, you could replace [ddu] [u l] [l9] with [ddll] [ll 9] with ll being the held l if you so desire. I'm going to keep it as is though. 
I'm also going to add a C C in place of [st], so it would be [l9] [9s-] [st-] [t k]. Remember again for C C it's important to use either P0 on the note, or turning down the volume for that specific note, or a combination of both. 
On to the next section! 
The lyrics for this part are "That I know I have those flaws". 
There's not much to touch here. Changing the [@v] into [@v-] [v dh] and changing [Os] into [Os-] [s f] and changing the CC and _CV into a full CCV. 
The lyrics for this next part are "I can't see clearly when my eyes are open"
]]>