Follow TV Tropes

Following

TV Tropes was offline due to server issues! Editing has been restored.

Go To

As many of you reported TV Tropes went offline at 2:40am EST on July 7th. It was offline for over 14 hours. The worst outage in many years. We did receive email and text alerts when it happened but unfortunately it was a major hardware failure which took quite a while to get under control.

The cause was a total failure of our database cluster. 6 of our 8 hard drives failed simultaneously resulting in a complete loss of data. We had to have our server company replace the cluster and then we had to rebuild the site from database backups. We do automatic backups every morning. Unfortunately the failure happened hours before the next backup so 24 hours of changes were lost.

To make it worse the history of wiki changes is only updated twice a week because it is over a 1TB in size. We are working on restoring that now so the history tab is blank on all pages until it's done. Editing will be offline for another 24 hours until we get that fixed. And it means we'll lose 72 hours of wiki history due to the timing of the last backup.

We will be working on optimizing our database structure so we can increase the frequency of our database backups to protect the data in the future.

We have redundant web servers on a load balancer, redundant database servers in a cluster and redundant hard drives in every server. So how did this happen? According to our server company there was a manufacturers bug in the firmware of the specific model that 6 of our 8 hard drives were on. That bug caused the disks to die after a certain number of hours running. We don't yet have all the details. They are reaching out to the manufacturer to get more information. I'll update here as I learn more.


UPDATE: (July 8th)

Editing is now enabled! History should be restored as of July 4th 10am EST. The history database imported faster than I expected (4 hours to decompress 1.1TB sql file, 12.5 hours to import)

The only thing I haven't done yet is purge the CDN cache. You must logout to view a page cache. Logged-in users get the live site. Not all pages are still cached and they do expire.

I'll hold off on purging the cache for a few more hours. If there is some specific edit you remember doing during those 24 hours that were lost you may be able to find it by logging out and viewing the cached page. Then login and make that edit again.

Edited by itcdr on Jul 8th 2020 at 7:12:45 AM

ACW Unofficial Wiki Curator for Complete Monster from Arlington, VA (near Washington, D.C.) Since: Jul, 2009
Ster Since: Apr, 2010
#277: Jul 9th 2020 at 2:45:34 PM

So how did this happen? According to our server company there was a manufacturers bug in the firmware of the specific model that 6 of our 8 hard drives were on.

I work for a company that makes storage appliances. I know all about drive firmware bugs. It's a damned miracle any of this stuff works at all... <sigh>

Data_Ninja_X Since: Apr, 2018 Relationship Status: Maxing my social links
#278: Jul 9th 2020 at 9:49:23 PM

Oh man, 6 out of 8 drives failed simultaneously, what?! I gotta wonder if those drives had never been run for that long before and then failing after a certain number of hours is a new issue or not.

nm3youtube Since: Jul, 2012
#279: Jul 10th 2020 at 12:18:47 AM

[up] It was bugged firmware that caused them to brick out of nowhere after (possibly) 40,000 hours. It's not that new an issue. If it was the brand I suspect it was and not a new model turning out to be affected, the server company should have updated the firmware.

Edited by nm3youtube on Jul 10th 2020 at 8:21:27 PM

josedanielsteller Since: Jun, 2017 Relationship Status: watch?v=dQw4w9WgXcQ
#280: Jul 12th 2020 at 2:51:52 AM

I know this disaster was almost a week ago, but GEEZ. If it weren't for those automatic server backups, this would've been the end of TV Tropes.

ImperialMajestyXO Since: Nov, 2015
#281: Jul 12th 2020 at 3:56:39 PM

If you don't mind my asking, what plans do we have going forward?

Piterpicher Veteran Editor IV from Poland, for real (Series 2) Relationship Status: Armed with the Power of Love
Veteran Editor IV
#282: Jul 13th 2020 at 7:31:01 AM

As stated above "We will be working on optimizing our database structure so we can increase the frequency of our database backups to protect the data in the future." (I don't know how they'll optimise it as it seems like mostly text and what the frequency would be, but we'll see). They'll also likely replace those bugged hard drives (not stated, but seems like common sense).

Edited by Piterpicher on Jul 13th 2020 at 4:32:19 PM

Currently mostly inactive. An incremental game I tested: https://galaxy.click/play/176 (Gods of Incremental)
HeraldAlberich from Ohio (Before Recorded History) Relationship Status: Gonna take a lot to drag me away from you
#283: Jul 13th 2020 at 1:16:45 PM

[up] The hosting company already replaced them under warranty.

Fighteer Lost in Space from The Time Vortex (Time Abyss) Relationship Status: TV Tropes ruined my love life
Lost in Space
#284: Jul 13th 2020 at 1:17:41 PM

Under, "It's your goddamn responsibility that these failed so you'd better fix it sharpish," warranty or no. tongue

Edited by Fighteer on Jul 13th 2020 at 4:17:50 AM

"It's Occam's Shuriken! If the answer is elusive, never rule out ninjas!"
GeneralGigan817 Since: Mar, 2020 Relationship Status: Non-Canon
#285: Jul 17th 2020 at 2:39:11 PM

TV Tropes crashed again.

WarJay77 Big Catch, Sparkle Edition (Troper Knight)
Big Catch, Sparkle Edition
#286: Jul 17th 2020 at 2:40:08 PM

I think it was a server thing. A lot of sites went down temporarily, at least for me and my boyfriend.

Currently Working On: Incorruptible Pure Pureness
Ultimatum Disasturbator from Second Star to the left (Old as dirt) Relationship Status: Wishfully thinking
Disasturbator
#287: Jul 17th 2020 at 2:44:31 PM

Internet wide yeah,

Something is happening on the world wide web

New theme music also a box
Fighteer Lost in Space from The Time Vortex (Time Abyss) Relationship Status: TV Tropes ruined my love life
Crossover-Enthusiast from an abaondoned mall (Lucky 7) Relationship Status: Chocolate!
#289: Jul 17th 2020 at 2:50:05 PM

Agh, I'm so thrown off by the borked timezones I keep thinking these posts were made hours ago.

Jawbreakers on sale for 99ยข
WarJay77 Big Catch, Sparkle Edition (Troper Knight)
Big Catch, Sparkle Edition
#290: Jul 17th 2020 at 2:51:24 PM

[up] Same...it's always jarring to re-adjust after just waking up.

But weird, yeah, at least it wasn't just us this time.

Currently Working On: Incorruptible Pure Pureness
mightymewtron Angry babby from New New York Since: Oct, 2012 Relationship Status: THIS CONCEPT OF 'WUV' CONFUSES AND INFURIATES US!
Angry babby
#291: Jul 17th 2020 at 2:52:12 PM

I was scared we'd lose another 24 hours, but since it was an external server issue I guess it didn't hurt the site's data.

I do some cleanup and then I enjoy shows you probably think are cringe.
DanteVin The Time Has Come from Somewhere Since: Jul, 2016 Relationship Status: Singularity
WarJay77 Big Catch, Sparkle Edition (Troper Knight)
Big Catch, Sparkle Edition
#293: Jul 17th 2020 at 4:45:07 PM

Nah, it was just cloudflare crashing.

Currently Working On: Incorruptible Pure Pureness
Piterpicher Veteran Editor IV from Poland, for real (Series 2) Relationship Status: Armed with the Power of Love
Veteran Editor IV
#294: Jul 17th 2020 at 10:44:11 PM

I've slept through it, but yeah, services did crash.

Currently mostly inactive. An incremental game I tested: https://galaxy.click/play/176 (Gods of Incremental)
SeptimusHeap from Switzerland (Edited uphill both ways) Relationship Status: Mu
#295: Jul 21st 2020 at 4:25:27 AM

Seems like the issue is now resolved, so I've taken down the headline and will close this thread as soon as I am done typing this post.

"For a successful technology, reality must take precedence over public relations, for Nature cannot be fooled." - Richard Feynman
Add Post

Total posts: 295
Top