How Facebook Stores Billions of Photos 154
David Gobaud writes "Jason Sobel, the manager of infrastructure engineering at Facebook, gave an interesting presentation titled Needle in a Haystack:
Efficient Storage of Billions of Photos at Stanford for the Stanford ACM. Jason explains how Facebook efficiently stores ~6.5 billion images, in 4 or 5 sizes each, totaling ~30 billion files, and a total of 540 TB and serving 475,000 images per second at peak. The presentation is now online here in the form of a Flowgram."
Re:I dunno. (Score:3, Informative)
To view the slideshow . . err I mean 'flowgram' (whatever the fuck that's supposed to mean), you dont need to register.
Ahh haha (Score:2, Informative)
Very interesting (Score:3, Informative)
Re:I dunno. (Score:4, Informative)
Re:FLASH?! (Score:2, Informative)
Re:FLASH?! (Score:3, Informative)
Worked for me from Ubuntu.
540TB / 30 billion images (Score:4, Informative)
RS
Re:540TB / 30 billion images (Score:5, Informative)
Already been done. (Score:5, Informative)
This stuff is cool either way, even if it is just "childish spam." Many of us only dream to work on something that will become this large scale.
...Fortune 500 companies could probably learn a thing or two...
Re:Akamai? (Score:2, Informative)
I dont work for FB, but a company that does make use of 3rd party caching networks for very large content distributions
Tm
server load fixed (Score:2, Informative)
Re:Full sized images, please (Score:3, Informative)
It's not ideal, but it works quite well. A friend of mine is a professional photographer and she puts all her work up there. Works well for her.
User-mode GoogleFS (Score:5, Informative)
(summarizing the big long presentation)
This is basically want to make a usermode GoogleFS. Their biggest problem is reducing reads - which are hampered by Posix file standards (inodes, metadata, etc...)
Instead they use a database-like index/data file arrangement. The index stays in memory and files are stored together in large contiguous spaces on a single file. It's possible to utilize a LUN for storage - but not there yet.
There... where's my cookie?
(Oddly enough - I'm writing the exact same code they are... bazaar world, eh??)
Re:Akamai? (Score:3, Informative)
That won't work considering the number of files. Given the quote (which require nearly a year of hassle with the Akamai morons and sighing an NDA, thus the AC post) we got from those idiots, it would cost us almost $200k/year given our bandwidth use to store ~1,000 files. Facebook has 30 billion files and assuming the same price per file as we were quoted, Akamai would charge $6,000,000,000,000/year to host them. To put that number in perspective, that's more than the GDP of the Germany plus that of the UK. The Akamai VP (something Danzig, I remember the name because of the band of the same name) I talked to just wasn't able to comprehend why we wouldn't consider paying that much to host a few files. We ended-up renting five 1U servers in four different countries for about $15k/year. While we have a little less total bandwidth and it requires more management time to maintain, it's only 7.5% of the cost of Akamai and we can store many(1,000 times?) more files than Akamai would let us at that price point. To say that the Akamai guys don't understand math is an understatement.