Slashdot Log In
Zvents Releases Open Source Cluster Database Based on Google
Posted by
ScuttleMonkey
on Fri Feb 08, 2008 06:19 PM
from the surprised-it-took-this-long dept.
from the surprised-it-took-this-long dept.
An anonymous reader writes "Local search engine company, Zvents, has released an open source distributed data storage system based on Google's released design specs. 'The new software, Hypertable, is designed to scale to 1000 nodes, all commodity PCs [...] The Google database design on which Hypertable is based, Bigtable, attracted a lot of developer buzz and a "Best Paper" award from the USENIX Association for "Bigtable: A Distributed Storage System for Structured Data" a 2006 publication from nine Google researchers including Fay Chang, Jeffrey Dean, and Sanjay Ghemawat. Google's Bigtable uses the company's in-house Google File System for storage.'"
Related Stories
Submission: Zvents releases open source cluster database by Anonymous Coward
This discussion has been archived.
No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
Full
Abbreviated
Hidden
Loading... please wait.
Looks promising (Score:1)
Re: (Score:1)
Re: (Score:1)
Re: (Score:1)
Re: (Score:2)
Kitten Nipples (Score:2, Insightful)
Re: (Score:1)
As for scaling, it would scale at the same rate as Non-Commodity Computers... if you have 999 computers all of equal performance, and then you add another one, you could expect a 0.1% change over-all...however its largely based on what sort of controllers you use, the same as h
Re: (Score:2)
Re: (Score:3, Informative)
Hadoop keeps all of its file system metadata in memory on a machine called the name node. This includes information about block placement and which files are allocated which blocks. Therefore, the big crunch we've seen is th
Re: (Score:1)
Re: (Score:2, Insightful)
You mean, like really, this time (Score:2)
Alright, I know it's only storage and not processing power, but that was inevitable.
Re: (Score:3, Informative)
Really, this time, a full fucking beowulf cluster (that runs linux!) is available to /.ers. No. Fucking. Way.
What?
Re: (Score:2)
Links for anyone interested (Score:1, Redundant)
Zvents: http://www.zvents.com/ [zvents.com]
how useful is DHT? (Score:4, Insightful)
Re:how useful is DHT? (Score:5, Interesting)
Parent
Re:how useful is DHT? (Score:4, Insightful)
In the 7 years I've been working in the industry, I've never delivered a single project that I would trust to a non-ACID database. Ever. And I doubt I ever will. If you want something that will generate some marketing material at high speed, and if it fails, who cares, well, use MySQL. If you want to do something that can handle a million pithy comments and if some of them get lost in the shuffle, who cares, well, that's fine too. Use whatever serves fast. If you're running Google, and it doesn't matter if a node drops out because there is no "right" answer to get wrong in the first place as long as you spit out a bunch of links, well, these sorts of non-resilient systems are fine.
Personally, I've never done projects like that. In my projects, if the data isn't perfect always and forever, it's worse than if it had never been written. It's very existence is a liability, because people will rely on it when they shouldn't, for things that can't get by with "close".
So yes. Transactional consistency and a solid relational model are pretty much mandatory, and not going anywhere soon. The idea that they might be replaced by technology such as this is laughable.
Parent
Re: (Score:3, Informative)
Relational databases don't implement the relational model correctly anyway. As for transactional consistency, you can get that on top of many different kinds of stores (including file systems); relational databases have no monopoly on that.
Re: (Score:1, Offtopic)
already, we have the Dick-Shrub using such databases to terrorize the populace with expansion planned.
Re: (Score:2)
Re: (Score:2)
Re: (Score:1)
Re: (Score:2)
Re: (Score:2)
I kind of like dehydrating/serializing objects to a simpler representation when persisting them. This uncomfortable step is nice because it shoehorns the data into a brand new instance.
But that may be just me.
Pairing with RDBMS (Score:1)
Column Orientated DBMS (Score:5, Informative)
IE:
a,b,c,d,e; 1,2,3,4,5,6; a,b,c,d,e;
instead of:
a, 1, a;
b, 2, b;
c, 3, c;
d, 4, d;
e, 5, e;
A cube using the time dimension would look like:
01:01:01; a,b,c,d,e; 1,2,3,4,5; a,b,c,d,e;
01:01:02; a,b,c,d,e; 1,2,6,4,5; a,b,c,d,e;
It's pretty difficult to do the same thing with row-based DBMS. However, you can see that doing an insert is going to be costly.. This looks like a pretty good try, I know there were some other projects going to try to replicate what BigTable does. And after hearing that IBM story the other day about one computer running the entire internet, I started thinking about Google.
More interesting is their distributed file system, which is what makes this really work well.
Re: (Score:3, Informative)
http://download.oracle.com/docs/cd/B19306_01/server.102/b14223/dimen.htm#i1006266 [oracle.com]
Distributed filesystem - Oracle RAC (Real Application Clusters) fits the bill.
Re: (Score:2)
No. (Score:2)
Oracle Dimensions are a logical overlay, they have no impact on how the data is physically organized in segments.
Neither does Oracle RAC -- it uses the same underlying storage format as regular Oracle.
You *could* do column-orientation in Oracle with a data cartridge, but that would likely be third party.
I could see Oracle offering this natively in a future release, maybe 11g r2...
I don't think so... (Score:3, Funny)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
You must be new here.
Re: (Score:1)
Re: (Score:2)
Re: (Score:1)
The correct word is "Asiantated".
Re: (Score:2)
Yay! (Score:1)
Re: (Score:1)
Re: (Score:1)
"Please wait while the Index is updated"
"Please wait while we Upload new entries"
"Please wait for the FBI to knock on your door"
Google 'Forms' (Score:3, Informative)
Don't forget HBase (Score:1)
Wheel: reinvented (Score:3, Insightful)
Yawn.
Re: (Score:2)
Re: (Score:3, Informative)
Pretty much every database on earth has key sorted ranges. Please be less of a noob. Go look up ondex_match_object.
No, it doesn't. It offers explicit locking, because it's been proven for decades that without it, you cannot have hard realtime queries, something that mnesia wanted to offer. You don't have to use tha
Re: (Score:2)
I had thought Bluetail was bought many years ago and absorbed into Nortel
Hbase -- Apache's BigTable (Score:2)
Re: (Score:2)