Follow TV Tropes

Following

Converting all data to UTF8 (now complete)

Go To

Hey tropers, we are about to undergo a large update to the tvtropes' database to convert the encoding. During this time the login system will be offline. You'll still be able to read TVTropes but you won't be able to post or edit anything during the conversion.

How long will the login system be down?

My most recent test run took 9 hours (my first test took 20+ hours so this is after a lot of optimizations to get down to 9 hours). ***it will now take 11 hours. see update below

Can you give me more details?

TVTropes was originally hosted on Windows' servers back in the day and all content on the site is encoded with Windows 1252 (a superset of ISO-8859-1, aka latin1).

According to the W3Tech, only 1.3% of internet traffic is ISO-8859-1. I'm guessing a large amount of that is TVTropes considering we have millions of pages defined with that charset.

The majority of the internet is encoded in UTF8. By doing this conversion it will bring us up to modern standards with the rest of the web and allow us to more easily support other languages and icons. It will also allow us to use more modern tools to help with editing such adding a WYSIWYG editor option. It will also help with code development as we often have to add special workarounds to continue to support this long deprecated charset.

Are we changing anything else?

While we have the login system offline, we are going to upgrade the edits history database table to include a sequence number. We've always wanted to do this but it could not be done while the site was online. That table has 40M+ rows of data. With this change we'll be able to make it so we can easily jump to any page when filtering to edits from a specific page or from a specific user. We have this for the forum and that's why you can jump to page 500 for example of a long running thread and not have a long delay.

When will this happen?

UPDATE: (take two) This is now scheduled for Mon Dec 5th at 8:30PM PST until Tue Dec 6th at 7:30AM

At 8:30PM PST you will be logged-off of TVTropes so make sure to save anything before that time. Once the conversion is done the next morning, I'll point the code to the new server and everyone should be auto logged in. If not, when you see the all clear announcement at the top of the page, go ahead and try logging in again.

The process will take approximately 11 hours. I had it down to 9 hours until I found examples of utf8 encoded values inside latin1 columns so I had to add some extra testing to ensure that data doesn't get double encoded in the process.

If you have any issues during the migration please send an email to thestaff@tvtropes.org

UPDATE: (complete) The migration is officially complete. It ended up taking nearly 12 hours but we got there. All data on TVTropes (hundreds of millions of rows of data) has now been converted from Latin1 to UTF8. We now have the same encoding as 99% of other websites and can support special characters and other languages. It will also allow us to build other tools such as a WYSIWYG editor.

Edited by itcdr on Dec 6th 2023 at 8:48:40 AM

MyFinalEdits Officially intimidated from Parts Unknown (Ten years in the joint) Relationship Status: TV Tropes ruined my love life
Officially intimidated
#51: Dec 3rd 2023 at 8:38:42 PM

Got it. I'll make sure not to do any major edits tomorrow just in case. I had planned to begin editing an adopted TLP draft, but I'll probably wait until this process is complete.

135 - 161 - 273 - 191 - 188 - 230 - 300
Lapsem Since: Aug, 2020
#52: Dec 3rd 2023 at 11:25:42 PM

Will current formatting for non-standard characters still work? (i.e.: &e acute;)

Check out my Pantheon Search Database prototype!
Ayumi-chan low-poly Shinri from Calvard (Apprentice) Relationship Status: Serial head-patter
low-poly Shinri
#53: Dec 4th 2023 at 3:18:18 AM

Bumping my question from the last page.

She/Her | Currently cleaning Char Clone
JHD0919 One-Track Mind (he/him) from a 12-pack of Diet Coke (Troper in training) Relationship Status: Abstaining
One-Track Mind (he/him)
#54: Dec 4th 2023 at 4:41:48 AM

[up]I mean...it's just logging in, right? I doubt there would be any bugs with the login system of all things.

This is Idol Tap. (My Troper Wall)
HarmonyBunny2000 Gamer Bunny Gal from my workplace (AKA my bedroom) (Troper in training) Relationship Status: He makes me feel like I have a heart
Gamer Bunny Gal
#55: Dec 4th 2023 at 5:13:46 AM

I'm curious if we'll get this month's Trope Report before or after the conversion.

skewview Since: Jun, 2013
#56: Dec 4th 2023 at 6:40:31 AM

[up] I'm hoping after, particularly because I'd wish for a late addition... well, late as in a few days ago.

Time-frame for conversion.

AFK with issues, will return
AudioSpeaks2 He/Him (Greenhorn) Relationship Status: Oh my word! I'm gay!
He/Him
#57: Dec 4th 2023 at 6:58:43 AM

[up] I think it should go after.

Art Museum Curator and frequent helper of the Web Original deprecation project
WarJay77 Bonnie's Artistic Cousin from The Void (Troper Knight) Relationship Status: Armed with the Power of Love
Bonnie's Artistic Cousin
#58: Dec 4th 2023 at 8:01:05 AM

The conversation won't change the grammar checking, though. It'll still have to happen and we don't like to add last minute work.

Current Project: Incorruptible Pure Pureness
itcdr Since: Aug, 2014
#59: Dec 4th 2023 at 9:07:22 AM

Ok. Everything is a go. I'll post a site wide announcement shortly. I'll start the conversion tonight at 8:30pm PST.

Here is how it will work:

- I'll disable logging in at 8:30pm PST tonight (I won't delete any cookies, just set the login function to return false)

- I'll wait 10 minutes, make sure no more inserts are happening then take a snapshot

- A complete copy of the database at that time will be copied to a new identical server

- From there I'll start converting all data to UTF8.

- When it's finished tomorrow morning, I'll switch the code to point to the new database server and everyone should instantly be logged-in (finger's crossed on the auto login part)

@MyFinalEdits, anything you do today will be saved as long you submit the changes before 8:30pm PST.

@Ayumi-chan, don't worry. I don't think there will be any issues with the login. All cookie data will be copied over so it should auto log you in when we switch to the new server. If that doesn't happen just re-login. And if there is any issue with your password just use the "forgot password" link on the login modal.

Edited by itcdr on Dec 4th 2023 at 9:08:35 AM

Lapsem Since: Aug, 2020
Malady (Not-So-Newbie)
#61: Dec 4th 2023 at 10:21:17 AM

Rarely used feature made years ago.

Disambig Needed: Help with those issues! tvtropes.org/pmwiki/posts.php?discussion=13324299140A37493800&page=24#comment-576
DeMarquis Who Am I? from Hell, USA Since: Feb, 2010 Relationship Status: Buried in snow, waiting for spring
Who Am I?
#62: Dec 4th 2023 at 10:24:59 AM

Will this have any effect on searching forum threads—hopefully improve it)?

"We learn from history that we do not learn from history."
Lymantria Tyrannoraptoran Reptiliomorph from Toronto Since: Apr, 2015 Relationship Status: Historians will say we were good friends.
Tyrannoraptoran Reptiliomorph
#63: Dec 4th 2023 at 10:26:22 AM

In the database update banner that now appears on every page on the site, someone misspelled “approximately”.

https://static.tvtropes.org/pmwiki/pub/images/a930db4e_9c81_43ca_a867_3d5239a336f2.jpeg
Oops.

[down][down] Thanks.

Edited by Lymantria on Dec 4th 2023 at 6:33:42 PM

Join the Five-Man Band cleanup project!
Lymantria Tyrannoraptoran Reptiliomorph from Toronto Since: Apr, 2015 Relationship Status: Historians will say we were good friends.
Tyrannoraptoran Reptiliomorph
#64: Dec 4th 2023 at 10:26:46 AM

[ignore]

Edited by Lymantria on Dec 4th 2023 at 6:27:01 PM

Join the Five-Man Band cleanup project!
itcdr Since: Aug, 2014
BonsaiForest a collection of small trees from the woods (4 Score & 7 Years Ago) Relationship Status: Tongue-tied
a collection of small trees
#66: Dec 4th 2023 at 11:52:46 AM

I've wondered before what was up with the weird formatting in articles where special characters are used. So that's the explanation!

So now I wonder, if this site isn't built on Unicode, how it'll handle emojis. Lemme try a cat emoji: 🐈

(EDIT: Huh, it rendered correctly. I guess the real test would be if it were inside a page itself, how the markup would look.)

(EDIT 2: Okay, it saved the cat emoji in the markup as AMPERSAND HASH 128008; in a test I ran.)

Edited by BonsaiForest on Dec 4th 2023 at 2:56:22 PM

I'm up for joining Discord servers! PM me if you know any good ones!
TargetmasterJoe Since: May, 2013
#67: Dec 4th 2023 at 11:53:27 AM

Thanks for the heads-up. Gives me a reason to go to sleep sooner. [lol]

punkcrow Tobias/TJ (He/Him) from Northwest Indiana Since: Dec, 2020 Relationship Status: Showing feelings of an almost human nature
Tobias/TJ (He/Him)
#68: Dec 4th 2023 at 12:17:16 PM

Good to know! I'd like to make it easier to type in special characters, admittedly.

Cold turkey's getting stale. Tonight I'm eating crow.
AgProv Ag Prov from Northern England Since: Jul, 2011 Relationship Status: With my statistically significant other
#69: Dec 4th 2023 at 12:18:34 PM

So this kicks in at 04:30 am GMT tomorrow morning. Got it.

Male, early sixties, Cranky old fart, at least two decades behind. So you have been warned. Functionally illiterate in several languages.
Risa123 Since: Dec, 2021 Relationship Status: Above such petty unnecessities
#70: Dec 4th 2023 at 12:20:08 PM

As someone who had to deal with encoding issues on this site, I'm happy that something is being done about it.

Fighteer Lost in Space from The Time Vortex (Time Abyss) Relationship Status: TV Tropes ruined my love life
Lost in Space
#71: Dec 4th 2023 at 1:00:34 PM

This will not change how wicks are constructed, and article titles/namespaces still have to be in plain ASCII.

"It's Occam's Shuriken! If the answer is elusive, never rule out ninjas!"
Malady (Not-So-Newbie)
#72: Dec 4th 2023 at 1:21:16 PM

Will it allow easier cutlisting / destruction of bad named work / ghost pages with non-ASCII characters titles?

Like that "Super Mario Långben" that's somewhere?

Edited by Malady on Dec 4th 2023 at 1:21:30 AM

Disambig Needed: Help with those issues! tvtropes.org/pmwiki/posts.php?discussion=13324299140A37493800&page=24#comment-576
Fighteer Lost in Space from The Time Vortex (Time Abyss) Relationship Status: TV Tropes ruined my love life
Lost in Space
#73: Dec 4th 2023 at 1:22:51 PM

Good question. I don't think the cutlist form will accept extended characters, but we can give it a shot.

"It's Occam's Shuriken! If the answer is elusive, never rule out ninjas!"
G-Editor Since: Mar, 2015 Relationship Status: Above such petty unnecessities
#74: Dec 4th 2023 at 1:36:31 PM

Hey so will this data conversion affect troopers who are suspended from certain forums? I'm only asking since I'm currently suspended from Long Terms Project

[down] I was thinking that if membership goes offline resulting in data conversion, it could effect those that got suspended in certain forums. If not then that is fine. I do know where to go to appeal

Edited by G-Editor on Dec 3rd 2023 at 11:56:03 PM

Fighteer Lost in Space from The Time Vortex (Time Abyss) Relationship Status: TV Tropes ruined my love life
Lost in Space
#75: Dec 4th 2023 at 1:41:11 PM

[up] I don't know why you would think that. No, it will have no effect on suspensions. If you want to appeal, use the Edit Banned thread.

"It's Occam's Shuriken! If the answer is elusive, never rule out ninjas!"

Total posts: 331
Top