Unicode UTF-8 to WAV Converter

AnthroHeart
Administrator

Repeater, Holo-Stones & Octave Tech Creator

Posts: 3,500

Unicode UTF-8 to WAV Converter Apr 2, 2024 13:53:24 GMT

Quote

Post by AnthroHeart on Apr 2, 2024 13:53:24 GMT

I created a UTF-8 to WAV Converter, so you can transform your emojis and foreign characters into a WAV file.

Code: github.com/tsweet77/wav-converter/raw/main/Unicode_to_WAV_Repeater_Smoothing.cpp

Binary: github.com/tsweet77/wav-converter/raw/main/Unicode_to_WAV_Repeater_Smoothing.exe

Here's what one document sounds like with Chinese lettering:

UTF-8.txt_432Hz_SmoothingPercent_0_48000.mp3 (458.83 KB)

Intention Repeater MAX Bundle: intention-repeater.sourceforge.io/

User Guide: sourceforge.net/projects/intention-repeater/files/Intention_Repeater_MAX_Bundle_User_Guide.pdf/download

Android App: android.intentionrepeater.com/

Bundle Video: www.youtube.com/watch?v=9w8u3_ucD90

AnthroHeart
Administrator

Repeater, Holo-Stones & Octave Tech Creator

Posts: 3,500

Unicode UTF-8 to WAV Converter Apr 2, 2024 15:19:26 GMT

Quote

Post by AnthroHeart on Apr 2, 2024 15:19:26 GMT

The highest printable UTF-8 character on my computer is: 􏿿

It might not print correctly here.

I tested it alongside some normal text, and it didn't seem to throw off the values that much.

But I'll see if I can work with 32-bit UTF-8 characters.

Intention Repeater MAX Bundle: intention-repeater.sourceforge.io/

User Guide: sourceforge.net/projects/intention-repeater/files/Intention_Repeater_MAX_Bundle_User_Guide.pdf/download

Android App: android.intentionrepeater.com/

Bundle Video: www.youtube.com/watch?v=9w8u3_ucD90

AnthroHeart
Administrator

Repeater, Holo-Stones & Octave Tech Creator

Posts: 3,500

Unicode UTF-8 to WAV Converter Apr 2, 2024 15:27:47 GMT

Quote

Post by AnthroHeart on Apr 2, 2024 15:27:47 GMT

The highest reasonable UTF-8 character is: 😂, U+1F602, which is well supported across platforms.

It did add peaks to my otherwise normal UTF-8 text.

When I used a chinese document, it didn't seem to affect the WAV that much.

Last Edit: Apr 2, 2024 15:33:30 GMT by AnthroHeart

Intention Repeater MAX Bundle: intention-repeater.sourceforge.io/

User Guide: sourceforge.net/projects/intention-repeater/files/Intention_Repeater_MAX_Bundle_User_Guide.pdf/download

Android App: android.intentionrepeater.com/

Bundle Video: www.youtube.com/watch?v=9w8u3_ucD90

reden
Global Moderator

Posts: 2,589

Unicode UTF-8 to WAV Converter Apr 2, 2024 15:29:00 GMT

Quote

Post by reden on Apr 2, 2024 15:29:00 GMT

Apr 2, 2024 15:19:26 GMT AnthroHeart said:

The highest printable UTF-8 character on my computer is: 􏿿

It might not print correctly here.

I tested it alongside some normal text, and it didn't seem to throw off the values that much.

But I'll see if I can work with 32-bit UTF-8 characters.

That is Undefined Character (U+10FFFF), part of PUA (Private Use Area, F0000-10FFFF), a place where any company or group may define any character to mean whatever they wish. For example, Apple has the apple logo you can write from a Mac or iphone there.

AnthroHeart Administrator Repeater, Holo-Stones & Octave Tech Creator Posts: 3,500	Unicode UTF-8 to WAV Converter Apr 2, 2024 15:40:12 GMT Quote Select Post Deselect Post Link to Post Member Give Gift Back to Top Post by AnthroHeart on Apr 2, 2024 15:40:12 GMT Should I define the highest code my Unicode to WAV can read to be 😂, U+1F602 (The most practical). or: 􏿿 (U+10FFFF)? The latter might require using 32-bit values.
	Last Edit: Apr 2, 2024 15:42:31 GMT by AnthroHeart Intention Repeater MAX Bundle: intention-repeater.sourceforge.io/ User Guide: sourceforge.net/projects/intention-repeater/files/Intention_Repeater_MAX_Bundle_User_Guide.pdf/download Android App: android.intentionrepeater.com/ Bundle Video: www.youtube.com/watch?v=9w8u3_ucD90

reden
Global Moderator

Posts: 2,589

Unicode UTF-8 to WAV Converter Apr 2, 2024 16:07:56 GMT

Quote

Post by reden on Apr 2, 2024 16:07:56 GMT

Apr 2, 2024 15:40:12 GMT AnthroHeart said:

Should I define the highest code my Unicode to WAV can read to be 😂, U+1F602 (The most practical).

or: 􏿿 (U+10FFFF)?

The latter might require using 32-bit values.

The final meaningful printable character, before arriving to the Supplementary and Tertiary Ideographical (Chinese, Japanese, Korean) characters, which are tens of thousands of those, is 🯹 U+1F602. (which looks like a calculator font 9)
🯹 belongs to Symbols for Legacy Computing and is farther away from 😂 as seen in en.wikibooks.org/wiki/Unicode/Character_reference/1F000-1FFFF .

AnthroHeart
Administrator

Repeater, Holo-Stones & Octave Tech Creator

Posts: 3,500

Unicode UTF-8 to WAV Converter Apr 2, 2024 16:09:39 GMT

Quote

Post by AnthroHeart on Apr 2, 2024 16:09:39 GMT

Apr 2, 2024 16:07:56 GMT reden said:

Apr 2, 2024 15:40:12 GMT AnthroHeart said:

Should I define the highest code my Unicode to WAV can read to be 😂, U+1F602 (The most practical).

or: 􏿿 (U+10FFFF)?

The latter might require using 32-bit values.

The final meaningful printable character, before arriving to the Supplementary and Tertiary Ideographical (Chinese, Japanese, Korean) characters, which are tens of thousands of those, is 🯹 U+1F602. (which looks like a calculator font 9)
🯹 belongs to Symbols for Legacy Computing and is farther away from 😂 as seen in en.wikibooks.org/wiki/Unicode/Character_reference/1F000-1FFFF .

I think I want to allow Chinese and foreign characters.

Intention Repeater MAX Bundle: intention-repeater.sourceforge.io/

User Guide: sourceforge.net/projects/intention-repeater/files/Intention_Repeater_MAX_Bundle_User_Guide.pdf/download

Android App: android.intentionrepeater.com/

Bundle Video: www.youtube.com/watch?v=9w8u3_ucD90

reden
Global Moderator

Posts: 2,589

Unicode UTF-8 to WAV Converter Apr 2, 2024 16:23:42 GMT

Quote

Post by reden on Apr 2, 2024 16:23:42 GMT

Apr 2, 2024 16:09:39 GMT AnthroHeart said:

Apr 2, 2024 16:07:56 GMT reden said:

The final meaningful printable character, before arriving to the Supplementary and Tertiary Ideographical (Chinese, Japanese, Korean) characters, which are tens of thousands of those, is 🯹 U+1F602. (which looks like a calculator font 9)
🯹 belongs to Symbols for Legacy Computing and is farther away from 😂 as seen in en.wikibooks.org/wiki/Unicode/Character_reference/1F000-1FFFF .

I think I want to allow Chinese and foreign characters.

By tens of thousands of them, I meant even more than there already are in the Common Planes. There are several Codepages fully dedicated to recording thousands of Chinese and Korean characters in the Basic Multilingual Plane, the most common and the first in the list.

AnthroHeart
Administrator

Repeater, Holo-Stones & Octave Tech Creator

Posts: 3,500

Unicode UTF-8 to WAV Converter Apr 2, 2024 16:31:32 GMT

Quote

Post by AnthroHeart on Apr 2, 2024 16:31:32 GMT

Apr 2, 2024 13:53:24 GMT AnthroHeart said:

I created a UTF-8 to WAV Converter, so you can transform your emojis and foreign characters into a WAV file.

Code: github.com/tsweet77/wav-converter/raw/main/Unicode_to_WAV_Repeater_Smoothing.cpp

Binary: github.com/tsweet77/wav-converter/raw/main/Unicode_to_WAV_Repeater_Smoothing.exe

I updated to allow for chinese and foreign langauge, and also allow for emojis.

GPT4 tells me this is the highest emoji value: 🧿U+1F9FF

I tested it with "🧿I am Love." and it did produce pulses but they weren't that bad. I used 0% smoothing.

I'd rather not have to use logarithmic interpolation or anything. The audio isn't that bad.

However if I use "🧿IIIIIIIIII" the audio goes flat.

Last Edit: Apr 2, 2024 16:37:21 GMT by AnthroHeart

Intention Repeater MAX Bundle: intention-repeater.sourceforge.io/

User Guide: sourceforge.net/projects/intention-repeater/files/Intention_Repeater_MAX_Bundle_User_Guide.pdf/download

Android App: android.intentionrepeater.com/

Bundle Video: www.youtube.com/watch?v=9w8u3_ucD90

AnthroHeart
Administrator

Repeater, Holo-Stones & Octave Tech Creator

Posts: 3,500

Unicode UTF-8 to WAV Converter Apr 2, 2024 16:46:45 GMT

Quote

Post by AnthroHeart on Apr 2, 2024 16:46:45 GMT

Ok, that makes sense that a bunch of the same letters "IIII" will produce silence.

When I did "🧿ababababababab" it did quiet down the audio to less than 100% max because of the outlier.

I may have to do statistical analysis. Or consult with Claude when I am able to access it again because of usage limits.

That one reduces the volume of the WAV by -4dB approximately. It isn't bad, but it's not 100%.

Last Edit: Apr 2, 2024 16:48:29 GMT by AnthroHeart

Intention Repeater MAX Bundle: intention-repeater.sourceforge.io/

User Guide: sourceforge.net/projects/intention-repeater/files/Intention_Repeater_MAX_Bundle_User_Guide.pdf/download

Android App: android.intentionrepeater.com/

Bundle Video: www.youtube.com/watch?v=9w8u3_ucD90

reden
Global Moderator

Posts: 2,589

Unicode UTF-8 to WAV Converter Apr 2, 2024 17:42:48 GMT

Quote

Post by reden on Apr 2, 2024 17:42:48 GMT

Apr 2, 2024 16:46:45 GMT AnthroHeart said:

Ok, that makes sense that a bunch of the same letters "IIII" will produce silence.

When I did "🧿ababababababab" it did quiet down the audio to less than 100% max because of the outlier.

I may have to do statistical analysis. Or consult with Claude when I am able to access it again because of usage limits.

That one reduces the volume of the WAV by -4dB approximately. It isn't bad, but it's not 100%.

-4dB is rather little, it doesn't really matter. I've heard that for speakers, hardware speaking, it's important to keep the volume at 95-99% instead of 100 as 100 could (unlikely, rarely) blow the speaker out.

reden
Global Moderator

Posts: 2,589

Unicode UTF-8 to WAV Converter Apr 2, 2024 17:48:54 GMT

Quote

Post by reden on Apr 2, 2024 17:48:54 GMT

Apr 2, 2024 16:31:32 GMT AnthroHeart said:

Apr 2, 2024 13:53:24 GMT AnthroHeart said:

I created a UTF-8 to WAV Converter, so you can transform your emojis and foreign characters into a WAV file.

Code: github.com/tsweet77/wav-converter/raw/main/Unicode_to_WAV_Repeater_Smoothing.cpp

Binary: github.com/tsweet77/wav-converter/raw/main/Unicode_to_WAV_Repeater_Smoothing.exe

I updated to allow for chinese and foreign langauge, and also allow for emojis.

GPT4 tells me this is the highest emoji value: 🧿U+1F9FF

I tested it with "🧿I am Love." and it did produce pulses but they weren't that bad. I used 0% smoothing.

I'd rather not have to use logarithmic interpolation or anything. The audio isn't that bad.

However if I use "🧿IIIIIIIIII" the audio goes flat.

The highest emoji is 🫸 1FAF8, Rightwards Pushing Hand

AnthroHeart
Administrator

Repeater, Holo-Stones & Octave Tech Creator

Posts: 3,500

Unicode UTF-8 to WAV Converter Apr 2, 2024 18:11:36 GMT

Quote

Post by AnthroHeart on Apr 2, 2024 18:11:36 GMT

I tried to take out outliers, but it would reduce the audio volume when using smoothing.
So I gave up. If your input has a bunch of outliers (unicode characters of way higher value than the rest of the text, it will reduce the overall volume. But it's not bad. It's like -4dB or something.

Intention Repeater MAX Bundle: intention-repeater.sourceforge.io/

User Guide: sourceforge.net/projects/intention-repeater/files/Intention_Repeater_MAX_Bundle_User_Guide.pdf/download

Android App: android.intentionrepeater.com/

Bundle Video: www.youtube.com/watch?v=9w8u3_ucD90

nathanmyersc
Senior Member

Posts: 291

Unicode UTF-8 to WAV Converter Apr 2, 2024 23:36:19 GMT

Quote

Post by nathanmyersc on Apr 2, 2024 23:36:19 GMT

Apr 2, 2024 18:11:36 GMT AnthroHeart said:

I tried to take out outliers, but it would reduce the audio volume when using smoothing.
So I gave up. If your input has a bunch of outliers (unicode characters of way higher value than the rest of the text, it will reduce the overall volume. But it's not bad. It's like -4dB or something.

Hmm somehow collect outliers after youve valued all the characters and convert them to unique values within the range you like

like compare all the highest valued you have. first get all the character values then create an set which is unique values. then find the highest value ones and see how far they are from the average. if they are really far then clamp them to unique values within the range you like. i still cannot get the audios made by your exe file to work

for the image writer not sure about the unicode writer atm. hope it works id like to make some sanskrit affrmatons.

Last Edit: Apr 2, 2024 23:37:47 GMT by nathanmyersc

AnthroHeart
Administrator

Repeater, Holo-Stones & Octave Tech Creator

Posts: 3,500

Unicode UTF-8 to WAV Converter Apr 2, 2024 23:40:13 GMT

Quote

Post by AnthroHeart on Apr 2, 2024 23:40:13 GMT

Apr 2, 2024 23:36:19 GMT nathanmyersc said:

Apr 2, 2024 18:11:36 GMT AnthroHeart said:

I tried to take out outliers, but it would reduce the audio volume when using smoothing.
So I gave up. If your input has a bunch of outliers (unicode characters of way higher value than the rest of the text, it will reduce the overall volume. But it's not bad. It's like -4dB or something.

Hmm somehow collect outliers after youve valued all the characters and convert them to unique values within the range you like

like compare all the highest valued you have. first get all the character values then create an set which is unique values. then find the highest value ones and see how far they are from the average. if they are really far then clamp them to unique values within the range you like. i still cannot get the audios made by your exe file to work

for the image writer not sure about the unicode writer atm. hope it works id like to make some sanskrit affrmatons.

Yes, I tried 1.5 standard deviations from the mean of all the text character values. It did take out the outliers, but it also reduced the amplitude when I used smoothing.

That method could also potentially take out characters that aren't too far off from the mean, so I dropped it.

Last Edit: Apr 2, 2024 23:40:47 GMT by AnthroHeart

Intention Repeater MAX Bundle: intention-repeater.sourceforge.io/

User Guide: sourceforge.net/projects/intention-repeater/files/Intention_Repeater_MAX_Bundle_User_Guide.pdf/download

Android App: android.intentionrepeater.com/

Bundle Video: www.youtube.com/watch?v=9w8u3_ucD90