Follow Slashdot blog updates by subscribing to our blog RSS feed

 



Forgot your password?
typodupeerror
×
Programming IT Technology

Software Archaeology 434

Plug1 writes "Salon (day pass needed) has an article about preserving software for historical purposes. It discusses source code archiving, and the effect the DMCA is having on attempts to catalog and analyze legacy code. It will be a shame if in the future a wealth of information is locked away because knoweldge of the underlying technology is lost."
This discussion has been archived. No new comments can be posted.

Software Archaeology

Comments Filter:
  • by Anonymous Coward on Wednesday July 30, 2003 @11:55AM (#6571166)
    July 30, 2003 | For Grady Booch, the nightmare goes something like this: Deep in the future, a team of archaeologists stumble onto a rare cache of 20th century art, a major assortment of works thought lost to the ravages of time. http://cm.mps.salon.com/mps/desk/nav/salonlogo.gif http://cm.mps.salon.com/mps/desk/nav/salonlogo.gif

    The only problem, of course, is that they don't know it. All the images are recorded in an obsolete digital format, JPEG, and nobody knows how to unscramble the data. As a result, the hard disk containing said artwork spends its days not in a museum but as a coffee coaster in some college professor's crowded office.

    "It might seem silly now, but put yourself 1,000 years in the future," says Booch, chief scientist at IBM's Rational Software subsidiary. "It's not too hard to imagine."

    In an industry where one man's clever C code is another man's Linear B, Booch already knows the frustration of playing software archaeologist. As co-developer of the Universal Modeling Language (UML), a mid-1990s effort to create a common "blueprint" notation for object-oriented software programs, he's spent the last 10 years laboring to spare future programmers the same torment.

    It's an uphill battle on a hill that is only growing steeper. With new programs replacing old and no major company or institution playing the central role of source-code archivist, the amount of software history currently circling the memory hole is scarily large. And even if there were a central institution, recent changes to the copyright code have made the transfer of source code from old media to new forms of storage a dicey prospect, legally. Add it all up, and you have the ideal makings for what some are already calling the "digital dark age."

    "Things are going to be lost not because people don't want to save them or because the original creators don't want to save them, but because they can't save them," says Brewster Kahle, founder of the Internet Archive, an institution that has lobbied for a safe harbor within the Digital Millennium Copyright Act to shield institutions looking to archive source code.

    For Booch, the barriers to software preservation aren't so much legal as educational. Most developers have come to accept the evolvable nature of software programs. What is lacking is the ability to examine static source-code snapshots with a scholarly, comparative eye. In the interest of encouraging that skill, Booch this fall will lead a seminar on software archaeology and preservation at the newly reopened Computer History Museum in Mountain View, Calif.

    "Our industry has had a major effect in changing the world," says Booch, talking over the phone from his Denver, Colo., office. "It would be great if we could preserve the artifacts and interview the architects while they're still alive."

    Booch isn't alone. Now that the hysteria surrounding Y2K has faded, developers are free to worry about legacy code again. One increasingly common worry is what to do with it? For every modern offshoot of DOS/Windows, Unix and Macintosh OS evolving with the marketplace, a dozen ghost programs lurk inside yellowed engineering pads, punch-card stacks and slowly degaussing magnetic memories. Even if programmers could get their hands on these programs and find a way to preserve and update their contents, a new question emerges: How do you qualitatively analyze those contents on a historical basis?

    "It's funny," says Dave Thomas, a Dallas software consultant and co-author, with Andrew Hunt, of "The Pragmatic Programmer," a 1999 book on software design methods. "Colleges spend a lot of time teaching people how to write code, but very few teach them how to read code. When you think about it, we programmers spend most of our time reading code, not writing code."

    To help fill the gap, Thomas served as cohost of the 2001 Software Archaeology: Understanding Large Systems workshop, hosted by Object Oriented Programming,
  • by bigman2003 ( 671309 ) on Wednesday July 30, 2003 @12:11PM (#6571334) Homepage
    DRM IS AN OPTION with Windows Media Player when ripping a CD. It is not mandatory. There is a checkbox at: Tools-->Options--->Copy Music See the 'Copy Protect Music' box? Uncheck it-
  • by Ian Lance Taylor ( 18693 ) <ian@airs.com> on Wednesday July 30, 2003 @12:26PM (#6571464) Homepage
    You can copy very small sections from a book or article, with attribution, because that is fair use. Copying the whole book or article is copyright infringement.

    Copying an entire article from another web site is also copyright infringement, unless of course the copyright terms of the article permit it.

    Salon probably makes some money per page view. They want you to look at their web site, not copy text off of it. Copying an entire article is almost certainly copyright infringement, and makes whoever does it liable for damages.
  • by TheRaven64 ( 641858 ) on Wednesday July 30, 2003 @12:28PM (#6571479) Journal
    Repeat after me:

    TCPA hardware is not the same as DRM, and is not evil

    The TCPA hardware specifies a cryptography co-processor on the mainboard. This can be used for DRM, but it can also be used for offloading things like SSL from the CPU. Emulating the hardware would be no good. Under *NIX, it would just be mounted at /dev/crypto (or something), and emulated if the hardware were not availible. It is the software which manages DRM.

  • Re:Nestalgia (Score:1, Informative)

    by Anonymous Coward on Wednesday July 30, 2003 @12:38PM (#6571567)
    Thinking that version n+1 is always better than version n or even n-20 is the typical thinking of a Microsoft lemming.
  • by Anonymous Coward on Wednesday July 30, 2003 @12:41PM (#6571601)
    or rather, it doesn't count towards kharma, so if you modded up with funny, then modded down as overrated, you lose kharma.

    You could lose 10 points on a funny post.
  • archive.org (Score:3, Informative)

    by dmnic ( 452122 ) on Wednesday July 30, 2003 @12:44PM (#6571628)
    they have a section for software where they are getting old software from the likes of Macromedia and others for preservation. havent seen any source-code listed, but its still a good service for history
  • by Andrew Leonard ( 4372 ) on Wednesday July 30, 2003 @12:46PM (#6571642) Homepage
    I responded to this above once already, but because this is dear to my heart, I'll do it again. Of course Salon isn't going to care if anyone prints out a copy and tapes it to their cube wall. But if a Web site grabs the text and posts it in a place like Slashdot, that deprives us of literally thousands of readers. Many of those readers might otherwise watch and ad and grab the daypass, which is good for our financial health, and some percentage of other readers might even subscribe, which is even better for us.

    Technically, it's copyright infringement, but Salon isn't going to devote resources to suing Slashdot or Slashdot readers. If we were going to go that route, we'd start with the Freerepublic assholes, who actively want us to go bankrupt and do everything they can to help us down that road. To slashdot readers, the best appeal I can make is simple.

    We want to make a living at what we do, so we can keep doing it. I want to keep paying great technology writers like Rachel Chalmers and Sam Williams to do interesting stories. If we convince enough readers to watch our ads or subscribe, we'll pull off this magic trick. So basically, the way I see it, any time a Slashdot reader posts the full text of a story on Slashdot, it's a vote against our survival, which is ironic, since you wouldn't be posting the stories if you didn't think there was some merit in them, right?
  • by stratjakt ( 596332 ) on Wednesday July 30, 2003 @12:50PM (#6571681) Journal
    Actually, that's not quite what's happening.

    ISDA spiders are trolling around and seeing a ftp/web site with "video game" in the text and offering files like pacman.zip and streetfighter2.zip for download.

    C&D notices are automatically being sent, none of it has to do with the DMCA, but with regular old copyright law, since the ISDA assumes the games are being put up for download.

    Whatshisface (who had the big manual site and shut it down) just couldnt be bothered to explain to anyone at the ISDA what files are.

    I dont think any manufacturer really gives a shit about people collecting/trading/photocopying the service and operation manuals, or even schematics for out of production machines.

  • Re:Heh... (Score:2, Informative)

    by ocelotbob ( 173602 ) <ocelot@@@ocelotbob...org> on Wednesday July 30, 2003 @01:02PM (#6571800) Homepage
    Depends on what you need. Windows, Linux, etc, are all sometimes too big for an application, or otherwise just get in your way. DOS is useful if you just need a small set of system calls, but otherwise, want nothing. Yeah, clocks are cheap these days, but at the same time, a wasted clock is still a wasted clock. I've got a feeling that there are still going to be a few DOS-based apps developed for the next few decades; DOS is just too entrenched to think otherwise.
  • by shoppa ( 464619 ) on Wednesday July 30, 2003 @01:17PM (#6571924)
    I know it's far easier to complain about the situation rather than do something about it. But there are groups doing something about it:
    1. The PDP-11 Unix Preservation Society [tuhs.org]
    2. The PDP-10 software archive [trailing-edge.com]
    3. SIMH Simulators for classic hardware [trailing-edge.com]
  • by JCCyC ( 179760 ) on Wednesday July 30, 2003 @01:41PM (#6572125) Journal
    Hi, Mr. Leonard, and let me first thank you all at Salon for making a great general-interest site for geeks and non-geeks alike.

    The Day Pass is a great idea, but some day or other I notice it plain doesn't work. Today, I tried to go to the article mentioned here, only to be redirected again and again to the same partial-content page. The Sprint ad never appears. Under Win 2000. Bot from IE and from Mozilla 1.4. I'd guess a technical problem on your (Ultramercial's?) side.

    In this circumstances, I'd consider the posting of the entire article forgivable (although the poster didn't state Day Pass problems as the reason, which puts his/her motives in question). Otherwise, I agree it's a rather uncivilized behavior.
  • by Andrew Leonard ( 4372 ) on Wednesday July 30, 2003 @02:45PM (#6572743) Homepage
    I am informed that earlier today the daypass option was broken. My apologies.
  • Re:Eyeglasses (Score:1, Informative)

    by Anonymous Coward on Wednesday July 30, 2003 @03:23PM (#6573168)
    the very concept of IP is just plain evil

    What you are talking about, however, is not IP; it was a trade secret.

    With modern computers, keeping something key (such as how SSL encryption is done) a trade secret is nay-to-impossible. From 1987 to 1994, RC4 was a trade secret; you had to buy RC4 from RSA labs to get the RC4 algorithm.

    This all changed in 1994; someone spilled the beans on the the cyberpunks mailing list (anonymously), and RC4 was no longer a trade secret.

    Because RSA labs opted to keep RC4 a trade secret instead of opting for IP protection, once the secret was out, there was nothing RSA labs could have done to protect the trade secret.

    If, however, they opted for IP protection, RC4 would be under a patent, and never would have been secret. The idea behind patents is that one gets 20 years of protection in exchange for not keeping their invention a secret. Of course, like anything else legal, this gets abused by greedy scum (such as Forgent and whats-his-face who delayed his patent filings).

    There is a difference between a copyright, a patent, and a trade secret; it would do the Slashdot readership quite some good to know these differences.
  • by David Leppik ( 158017 ) on Wednesday July 30, 2003 @04:14PM (#6573638) Homepage
    Although the issues involved in this case are slightly different, The term 'Software Archaeology' (or at least 'Programmer Archeologist') might come from Vernor Vinge's book 'A Deepness in the Sky'.

    In that book, code-as-data is taken to an extreme, and the best programmers have the title "Programmer Archaeologist", since they spend little time writing new code; instead they look through old code to find something written for a similar situation. It isn't that old programmers are better-- it's that the software contains facts and information that are of value.

    Whereas on Star Trek someone might look through an ancient captain's log to learn about a bizarre planet/new race/weird disease/strange technology, in Vinge's book that sort of specialized information is stored in the source code for software that was written at the time to deal with the situation.
  • by ncc74656 ( 45571 ) <scott@alfter.us> on Wednesday July 30, 2003 @04:48PM (#6573907) Homepage Journal
    Handguns are made for killing people. All opinions aside, that's a basic fact. Handguns are a tool designed for killing, and specifically for killing people. Arguing that they have some other practical use is just silly.

    Before continuing to demonstrate your ignorance on this subject, you might wish to visit this site [nra.org] and enlighten yourself. At the very least, you might consider at least not automatically taking what these [bradycampaign.org] maroons [vpc.org] say as gospel. This [amazon.com] is also highly-recommended reading.

    It's just a suggestion...take it or leave it, but I'd rather not engage in a battle of wits with an unarmed man. It's too much like shooting fish in a barrel...it quickly gets boring.

  • by buffer-overflowed ( 588867 ) on Wednesday July 30, 2003 @05:06PM (#6574094) Journal
    Don't own a gun do ya?

    Actually, target shooting guns are designed to accurately put a tiny hole in a piece of paper at varying distances. This includes some handguns. Yup, they're designed for sports shooting, wow, imagine that. These are generally low caliber, now you can kill a person with a .22 but that's not their purpose (mainly hunting).

    Handguns are designed for varying purposes, but mainly that purpose is to put some form of projectile into the air at something over 1,000 feet per second. This projectile can be a bullet (lethal, or perhaps just for stopping power [like shooting someone in the arm or leg to knock him down]) or a non-lethal rubber bullet. For instance law-enforcement in this country often uses .45s (big bullets). These guns have excellent stopping power, meaning if I shoot you in a non-lethal manner, you're going to get knocked the hell around and probably not be a threat anymore. Compared to a 7.62 x 39mm round which has excellent penetration or a .223 which has excellent penetration and range (M16's use em [.223 that is]). (But those are rifle rounds, you said handguns right?) A firearm is a weapon, so is a sword, whether or not it is used to kill is completely up to the intent of the wielder. It can also injure, scare, protect.

    In fact, paper suffers more than just about anything else in this country from firearm ownership.

    Won't somebody think of the trees?
  • Dark Ages II (Score:3, Informative)

    by jeremycec ( 639648 ) on Wednesday July 30, 2003 @05:07PM (#6574100)
    Brian Bergeron gives a fairly decent treatment of the whole data loss issue in his book Dark Ages II: When the Digital Data Die [barnesandnoble.com] . Although, this could be a lot of hysteria over nothing. As I recall in Asimov's Foundation's Edge [kodu.net], Trevize comes across some ancient computers, and they just fire up and start working beautifully right away after centuries of disuse. Heheh, if only this were the case. The hard drive on the HP I got last Christmas already crapped out.

A morsel of genuine history is a thing so rare as to be always valuable. -- Thomas Jefferson

Working...