Debugging 290

Posted by timothy on Tuesday February 24, 2004 @03:41PM from the unlousy dept.

dwheeler writes "It's not often you find a classic, but I think I've found a new classic for software and computer hardware developers. It's David J. Agan's Debugging: The 9 Indispensable Rules for Finding Even the Most Elusive Software and Hardware Problems." Read on for the rest.

Debugging: The 9 Indispensable Rules for Finding Even the Most Elusive Software and Hardware Problems
author	David J. Agans
pages	192
publisher	Amacom
rating	9
reviewer	David A. Wheeler
ISBN	0814471684
summary	A classic book on debugging principles

Debugging explains the fundamentals of finding and fixing bugs (once a bug has been detected), rather than any particular technology. It's best for developers who are novices or who are only moderately experienced, but even old pros will find helpful reminders of things they know they should do but forget in the rush of the moment. This book will help you fix those inevitable bugs, particularly if you're not a pro at debugging. It's hard to bottle experience; this book does a good job. This is a book I expect to find useful many, many, years from now.

The entire book revolves around the "nine rules." After the typical introduction and list of the rules, there's one chapter for each rule. Each of these chapters describes the rule, explains why it's a rule, and includes several "sub-rules" that explain how to apply the rule. Most importantly, there are lots of "war stories" that are both fun to read and good illustrations of how to put the rule into practice.

Since the whole book revolves around the nine rules, it might help to understand the book by skimming the rules and their sub-rules:

Understand the system: Read the manual, read everything in depth, know the fundamentals, know the road map, understand your tools, and look up the details.
Make it fail: Do it again, start at the beginning, stimulate the failure, don't simulate the failure, find the uncontrolled condition that makes it intermittent, record everything and find the signature of intermittent bugs, don't trust statistics too much, know that "that" can happen, and never throw away a debugging tool.
Quit thinking and look (get data first, don't just do complicated repairs based on guessing): See the failure, see the details, build instrumentation in, add instrumentation on, don't be afraid to dive in, watch out for Heisenberg, and guess only to focus the search.
Divide and conquer: Narrow the search with successive approximation, get the range, determine which side of the bug you're on, use easy-to-spot test patterns, start with the bad, fix the bugs you know about, and fix the noise first.
Change one thing at a time: Isolate the key factor, grab the brass bar with both hands (understand what's wrong before fixing), change one test at a time, compare it with a good one, and determine what you changed since the last time it worked.
Keep an audit trail: Write down what you did in what order and what happened as a result, understand that any detail could be the important one, correlate events, understand that audit trails for design are also good for testing, and write it down!
Check the plug: Question your assumptions, start at the beginning, and test the tool.
Get a fresh view: Ask for fresh insights, tap expertise, listen to the voice of experience, know that help is all around you, don't be proud, report symptoms (not theories), and realize that you don't have to be sure.
If you didn't fix it, it ain't fixed: Check that it's really fixed, check that it's really your fix that fixed it, know that it never just goes away by itself, fix the cause, and fix the process.

This list by itself looks dry, but the detailed explanations and war stories make the entire book come alive. Many of the war stories jump deeply into technical details; some might find the details overwhelming, but I found that they were excellent in helping the principles come alive in a practical way. Many war stories were about obsolete technology, but since the principle is the point that isn't a problem. Not all the war stories are about computing; there's a funny story involving house wiring, for example. But if you don't know anything about computer hardware and software, you won't be able to follow many of the examples.

After detailed explanations of the rules, the rest of the book has a single story showing all the rules in action, a set of "easy exercises for the reader," tips for help desks, and closing remarks.

There are lots of good points here. One that particularly stands out is "quit thinking and look." Too many try to "fix" things based on a guess instead of gathering and observing data to prove or disprove a hypothesis. Another principle that stands out is "if you didn't fix it, it ain't fixed;" there are several vendors I'd like to give that advice to. The whole "stimulate the failure, don't simulate the failure" discussion is not as clearly explained as most of the book, but it's a valid point worth understanding.

I particularly appreciated Agans' discussions on intermittent problems (particularly in "Make it Fail"). Intermittent problems are usually the hardest to deal with, and the author gives straightforward advice on how to deal with them. One odd thing is that although he mentions Heisenberg, he never mentions the term "Heisenbug," a common jargon term in software development (a Heisenbug is a bug that disappears or alters its behavior when one attempts to probe or isolate it). At least a note would've been appropriate.

The back cover includes a number of endorsements, including one from somebody named Rob Malda. But don't worry, the book's good anyway :-).

It's important to note that this is a book on fundamentals, and different than most other books related to debugging. There are many other books on debugging, such as Richard Stallman et al's Debugging with GDB: The GNU Source-Level Debugger. But these other texts usually concentrate primarily on a specific technology and/or on explaining tool commands. A few (like Norman Matloff's guide to faster, less-frustrating debugging ) have a few more general suggestions on debugging, but are nothing like Agans' book. There are many books on testing, like Boris Beizer's Software Testing Techniques, but they tend to emphasize how to create tests to detect bugs, and less on how to fix a bug once it's been detected. Agans' book concentrates on the big picture on debugging; these other books are complementary to it.

Debugging has an accompanying website at debuggingrules.com, where you can find various little extras and links to related information. In particular, the website has an amusing poster of the nine rules you can download and print.

No book's perfect, so here are my gripes and wishes:

The sub-rules are really important for understanding the rules, but there's no "master list" in the book or website that shows all the rules and sub-rules on one page. The end of the chapter about a given rule summarizes the sub-rules for that one rule, but it'd sure be easier to have them all in one place. So, print out the list of sub-rules above after you've read the book.
The book left me wishing for more detailed suggestions about specific common technology. This is probably unfair, since the author is trying to give timeless advice rather than a "how to use tool X" tutorial. But it'd be very useful to give good general advice, specific suggestions, and examples of what approaches to take for common types of tools (like symbolic debuggers, digital logic probes, etc.), specific widely-used tools (like ddd on gdb), and common problems. Even after the specific tools are gone, such advice can help you use later ones. A little of this is hinted at in the "know your tools" section, but I'd like to have seen much more of it. Vendors often crow about what their tools can do, but rarely explain their weaknesses or how to apply them in a broader context.
There's probably a need for another book that takes the same rules, but broadens them to solving arbitrary problems. Frankly, the rules apply to many situations beyond computing, but the war stories are far too technical for the non-computer person to understand.

But as you can tell, I think this is a great book. In some sense, what it says is "obvious," but it's only obvious as all fundamentals are obvious. Many sports teams know the fundamentals, but fail to consistently apply them - and fail because of it. Novices need to learn the fundamentals, and pros need occasional reminders of them; this book is a good way to learn or be reminded of them. Get this book.

If you like this review, feel free to see Wheeler's home page, including his book on developing secure programs and his paper on quantitative analysis of open source software / Free Software. You can purchase Debugging: The 9 Indispensable Rules for Finding Even the Most Elusive Software and Hardware Problems from bn.com. Slashdot welcomes readers' book reviews -- to see your own review here, read the book review guidelines, then visit the submission page.

Debugging

This discussion has been archived. No new comments can be posted.

Search 290 Comments Log In/Create an Account

Comments Filter:

Change one thing at a time (Score:5, Insightful)

by tcopeland ( 32225 ) * writes: <tom@@@thomasleecopeland...com> on Tuesday February 24, 2004 @03:47PM (#8376822) Homepage

> Change one thing at a time: Isolate the
> key factor, grab the brass bar with both
> hands (understand what's wrong before fixing),
> change one test at a time, compare it with a
> good one, and determine what you changed
> since the last time it worked.

This is helpful with unit tests, too. If I find a bug, I want to figure out which unit test should have caught this and why it didn't. Then I can either fix the current tests, or add new ones to catch this.

Either way, if someone reintroduces that particular bug it'll get caught by the unit tests during the next hourly build [ultralog.net].

He forgot regression tests (Score:5, Insightful)

by mark99 ( 459508 ) writes: on Tuesday February 24, 2004 @03:50PM (#8376853) Journal

Regression test suites (if possible) should be maintained so that when bugs get fixed, they stay fixed.

Just my 2 cents.

Good read (Score:5, Insightful)

by GoMMiX ( 748510 ) writes: on Tuesday February 24, 2004 @03:50PM (#8376857)

"
If you didn't fix it, it ain't fixed: Check that it's really fixed, check that it's really your fix that fixed it, know that it never just goes away by itself, fix the cause, and fix the process."

I can think of a WHOLE lot of tech's and admin's who really need to follow number 9 a lot closer.

Especially those Windows admins/techs who think 'restart' is the ultimate fix-all. Though, sadly, I suppose in many cases that's about all you can do with proprietary software. Well, that and beg vendors to fix the problem. (We all know how productive that is....)

but how do you know it's fixed? (Score:5, Insightful)

by sohp ( 22984 ) writes: <snewton AT io DOT com> on Tuesday February 24, 2004 @03:51PM (#8376858) Homepage

Nothing about writing code for a test case that exercises the bug, then rerunning it every time you make a change you think will fix the bug? Seems like a big oversight. Any program of reasonable size is going to require wasting a significant amount of time restarting and re-running to the point of failure, and with every manual check of the result, there's an increasing probability that fallible human will make a mistake.

More programmers need to get Test Infected [sourceforge.net].

Re:Hardware *Debugging*? (Score:5, Insightful)

by Mick Ohrberg ( 744441 ) writes: <mick.ohrberg@g[ ]l.com ['mai' in gap]> on Tuesday February 24, 2004 @03:52PM (#8376889) Homepage Journal
My boss has three standard trouble-shooting questions:
1. Is it plugged in?
2. Are you logged in?
3. Is it spelled right?
Works in 9 cases out of 10.
Negative. (Score:1, Insightful)

by Anonymous Coward writes: on Tuesday February 24, 2004 @03:53PM (#8376904)

Chips have bugs, why do you think there are re-spins? We are talking from a design point here, not a "techie-fix-this-shit" point. Different ballgame.

Time (Score:5, Insightful)

by quarkoid ( 26884 ) writes: on Tuesday February 24, 2004 @03:53PM (#8376906) Homepage

One thing's clear from looking at that list - spend more time on testing your code.

Unfortunately, speaking as an ex-programmer, time is one luxury that PHBs don't afford their minions. A project needs to be completed and knocked out of the door as soon as possible. The less time spent on unnecessary work, the better.

It is also unfortunate that PC users have been brought up expecting to have buggy software in front of them and expecting to have to reboot/reinstall. What motivation is there to produce bug free code when the users will accept buggy code?

Ho well, at least I run my own company now - master of my own wallet - and can concentrate on quality solutions.

Re:Hardware *Debugging*? (Score:5, Insightful)

by pclminion ( 145572 ) writes: on Tuesday February 24, 2004 @03:54PM (#8376922)

I think the term you want is TROUBLESHOOTING.
Troubleshooting is what you do to fix your mom's ethernet card. "Oooh, it's on the bottom PCI slot, has no interrupt line. I'll just move it up one slot..."
Debugging is what you do with an oscilloscope to figure out why a particular circuit design isn't working as anticipated. You don't "troubleshoot" a circuit design. You debug it.
Or, to put it another way, "troubleshooting" is what a tech support monkey does. "Debugging" is what an engineer does.

Re:Hardware *Debugging*? (Score:3, Insightful)

by wondafucka ( 621502 ) writes: on Tuesday February 24, 2004 @03:55PM (#8376926) Homepage Journal

Get off it. I can't think of a single reason why someone can't "debug" hardware or anything else for that matter. The origin of the word comes from a troubleshooting situation anyways. Why should someone be able to debug a relational database but not a relationship?

Re:yuck (Score:2, Insightful)

by kooso ( 699340 ) writes: on Tuesday February 24, 2004 @03:56PM (#8376950)

Not me. It would be interesting to have a rule of thumb for the real economic cost of debugging this way.

We all (except Dijstra [utexas.edu], perhaps) take trade-offs, for a reason. Perhaps that reason is only ignorance, but then we wouldn't get anything done.

Race Conditions? (Score:5, Insightful)

by Speare ( 84249 ) writes: on Tuesday February 24, 2004 @04:03PM (#8377015) Homepage Journal

Make It Fail is pretty hard to do when it comes to race conditions. This has got to be the most frustrating kind of bug. Others are referring to the Heisenbug which comes in a variety of flavors.
Sometimes you don't KNOW when there's multiple threads or processes, or when there are other factors involved.
Have you noticed that a new thread is spawned on behalf of your process when you open a Win32 common file dialog? Have you noticed that MSVC++ likes to initialize your memory to values like 0xCDCDCDCD after operator new, but before the constructor is called? It also overwrites memory with 0xDDDDDDDD after the destructors are called. And that it ONLY does these things when using the DEBUG variant build process? Did you know that .obj and .lib can be incompatible if one expects DEBUG and the other expects non-DEBUG memory management?
Someone on perlmonks.org was just asking about a Heisenbug where just the timing of the debugger threw off his network queries. Add the debugger, it works. Take away the debugger, it fails. I've got a serial-port device which comes with proprietary drivers that seem to have the same sort of race condition.
The top 9 rules mentioned here look great. But you could write a whole book on just debugging common race conditions for the modern multi-threaded soup that passes for operating systems, these days.

Re:Heisenbugs... (Score:5, Insightful)

by WayneConrad ( 312222 ) * writes: <wconrad&yagni,com> on Tuesday February 24, 2004 @04:04PM (#8377029) Homepage

Heisenbugs are almost always caused by buffer overflows.

They are also almost always caused by race conditions, the most insidious of which is thread-safe code that turns out only to be safe on a uniprocessor system.

And don't forget the phase of the moon, or for the truly unlucky, intermittently glitchy hardware.

I really liked the book, but I would have... (Score:5, Insightful)

by mykepredko ( 40154 ) writes: on Tuesday February 24, 2004 @04:05PM (#8377044) Homepage

probably added a step stating that the problem symptoms and causes should be articulated clearly (probably between #3 and #4) before trying to fix anything. I've seen too many engineers/programmers/technicians list symptoms and attack them individually, only to discover that they were related.

On the surface, this flies in the face of "divide and conquer" - but what I'm really saying here is make sure you have the problem bounded before you attack it.

Also, with Step 9, I would have liked to see more emphasis on ensuring that nothing else is affected by the "fix". Making changes to code to fix a problem is often a one step forward and two steps backwards when you don't completely understand the function of the code that was being changed.

All in all, an excellent book in a little understood area.

myke

Missed one: explain it to someone (Score:5, Insightful)

by deanj ( 519759 ) writes: on Tuesday February 24, 2004 @04:06PM (#8377060)

They missed a good one: explain the bug to someone.

If you start explaining the bug to someone, there's a good chance in mid-explanation you'll realize a solution to the problem.

Some school (can't remember which) had a Teddy Bear in their programming consulting office... There was a sign. "Explain it to the bear first, before you talk to a human". Silly as it sounds, people would do it, and a large portion of the time they'd never actually have to consult the staff... by explaining it to the bear, they solved the problem.

Weird, but true.

Missing rule (Score:3, Insightful)

by timdaly ( 539918 ) writes: on Tuesday February 24, 2004 @04:06PM (#8377062)

He missed a rule: Explain the bug to someone else.
The second pair of eyes often finds the problem
even if they don't have a clue what you are talking
about.

Re:Heisenbugs... (Score:5, Insightful)

by kzinti ( 9651 ) writes: on Tuesday February 24, 2004 @04:09PM (#8377107) Homepage Journal

Heisenbugs are almost always caused by buffer overflows.

In my experience, Heisenbugs are almost always caused by stack problems. That's why they go away when you put print statements in the code - because you're causing the usage of the stack to change.

Buffer overflows (to arrays on the stack) are one good way to munge the stack. Returning the address of an input parameter or automatic variable is another way, because these are declared on the stack and cease to exist when the enclosing block exits. Anybody else using such an address is writing into the stack in an undefined manner, and chaos can result!

A missing rule (Score:5, Insightful)

by Tired and Emotional ( 750842 ) writes: on Tuesday February 24, 2004 @04:10PM (#8377124)

One rule he's missed is very important: Before making a measurement (like printing the value of a variable or changing something about the code) work out what answer you expect to see. Note well - do this before you look at the result. When you see something different, either its a symptom of the bug, or a symptom of you not yet understanding the system. Resolving this will either improve your understanding or turn up the problem.

Re:Good read (Score:5, Insightful)

by swb ( 14022 ) writes: on Tuesday February 24, 2004 @04:11PM (#8377126)

No, it's number *5* that EVERYONE needs to remember to follow. I see way too many people (including myself in a hurry) changing more than one thing at a time and then immediately wondering what fixed or why it didn't get fixed.

This is especially important when changing a second variable can actually mask the fix of the change of the first variable or cause a second failure that appears to be the same as the initial failure.

I guess they should have added a rule 10: be patient and systematic. Obvious problems usually have non-obvious solutions, and a thorough examination of the situation is time consuming. Don't take short cuts or you might miss the problem.

An extra rule (Score:5, Insightful)

by MythMoth ( 73648 ) writes: on Tuesday February 24, 2004 @04:17PM (#8377218) Homepage

"Describe the problem to someone else."

This is so effective that it doesn't require the person to whom you're explaining it to pay attention, or even understand. A manager will do ;-) Even when the person to whom you're explaining it is smart, alert, and interested, it's almost never them that fixes the bug.

The process of describing the behaviour of the program as it ought to be versus the behaviour it is exhibiting forces you to step back and consider only the facts. This in turn is often enough to give you an insight into the disconnect between what's really happening and what you know should be happening.

If you catch yourself saying "that's impossible" when debugging some particularly freaky bit of behaviour, it's definitely time to try this.

The input of the other party is so irrelevant in this process that we used to joke about keeping a cardboard cut-out programmer to save wear and tear on the real ones...

Fresh view: visit next lower level of abstraction (Score:4, Insightful)

by Flexagon ( 740643 ) writes: on Tuesday February 24, 2004 @04:27PM (#8377322)

A good list. As part of rule 8, it's often extremely helpful to look at the problem from a different level of abstraction than one normally would (e.g., different than you coded, or that you best understand it). This often exposes false assumptions that may be blocking a proper analysis.

Successful debugging is a lot like any hard science, particularly if you are not, and cannot, become familiar with the entire system first. Your "universe" is the failing system. You develop hypotheses (failure modes and potential fixes) and run experiments (test them). You have solved the problem only if you completely close the loop (your fix worked, it worked in the way you expected, your hypothesis completely explains the circumstances, and peer review concurrs).

A big part of the "art" is cultivating an attitude of how systems are stressed, and how they may fail under those stresses.

Re:Now That It's Written Down (Score:3, Insightful)

by Parity ( 12797 ) writes: on Tuesday February 24, 2004 @04:35PM (#8377417)

Actually, the 'too many cooks' problem has already been covered pretty thoroughly in The Mythical Man Month, but it does sound like this book might get a place beside MMM and be equally useful for steering managers.

--Parity None

Re:The first law of debugging (Score:3, Insightful)

by Marvin_OScribbley ( 50553 ) writes: on Tuesday February 24, 2004 @04:56PM (#8377624) Homepage Journal

"The most likely source of the current bug is the fix you made to the last one."

Actually that's a corollary to the first law, which is:
"Every bug fix will cause two more."

"Thats a feature" (Score:3, Insightful)

by peter303 ( 12292 ) writes: on Tuesday February 24, 2004 @05:08PM (#8377743)

I am not surprised at the number of so-called bugs that turn out to be holes in the specifications or tests. Then I tell the complaintant "thats the design specification". Then they say "no, thats not" and give me the updated specification.

In fact, popular bug-tracking databases like Scopus usually merge bugs and enhancement requests together, due to this ambiguity.

Re:Race Conditions? (Score:3, Insightful)

by Ben Hutchings ( 4651 ) writes: on Tuesday February 24, 2004 @06:23PM (#8378747) Homepage

The people who gave us Java are way too fond of threads, actually. (Want non-blocking sockets? Sorry, you'll have to add one thread per socket.) Most programmers still don't understand how and why to do synchronisation because they don't understand how weak modern memory models are (and have to be if processors are to continue accelerating). So while threads should not be banned they should also not be used without careful consideration of the consequences (more complex code and possibly reduced performance due to synchronisation) and the alternatives (multiple processes, asynchronous I/O, maybe no concurrency at all).

Two of my favourite rules... (Score:3, Insightful)

by rumblin'rabbit ( 711865 ) writes: on Tuesday February 24, 2004 @07:01PM (#8379296) Journal

Find the simplest possible run that replicates the error. My favourite strategy. It's really worth while doing this. Related to rule 4, perhaps, but not the same thing.
Examine the input data. Often it isn't a bug. Often the program is doing an entirely reasonable thing given the input data. Or perhaps the program mishandled bad input data (in which case it is a bug, but now you know what to look for).

Re:Heisenbugs... (Score:3, Insightful)

by kzinti ( 9651 ) writes: on Tuesday February 24, 2004 @07:34PM (#8379668) Homepage Journal

In my experience, Heisenbugs are often the result of race conditions between concurrent threads.

Yeah, but thread problems are so slippery, I don't even think of them as Heisenbugs. I think of them as Neutrinobugs.

A stack-related Heisenbug (or really any kind of Heisenbug, for that matter) will always occur in the same place, given the same conditions. Always the same location, always the same stack trace. But when you stick in a print statement, the bug moves, or - worse - it goes away altogether. That'll make you pull your hair out the first couple of times it happens to you, but after a while you learn to spot them pretty quickly.

Race conditions between threads, however, are maddening in their irregularity. They rarely happen in the same place at the same time. (If they do, you're lucky.) They can be random in when they choose to pop up. One time you might run five minutes before you see a crash. Next time, you might run hours before the program falls over and dies. And when you do get a crash, it's never in the same place, and often it's not even "near" the bad code. Two threads write to the same data structure at the same time because it wasn't locked correctly - and the program can continue running WAAAAY past the bad access. I've seen grown men brought to their knees, sobbing like little children, over threading problems. Race conditions keep suicide hotlines in business.

And people wonder why I'm compulsive about putting locks around my data.

Re:Change one thing at a time (Score:3, Insightful)

by e-Motion ( 126926 ) writes: on Tuesday February 24, 2004 @07:41PM (#8379752)

I particularly liked the rule about "Quit thinking and look". I worked with a guy who used what I call the "Zen method of debugging". He would keep staring at the code, trying to determine what was going on. I, on the other hand, would throw in some print statements so I could see what was going on.

Sometimes reading the code is enough. If you're good at reading code, then sometimes all you have to do is briefly look over what you wrote to spot the bug. YMMV, of course. If you've looked at the code for a few minutes and nothing looks obviously wrong, then it's probably time to use the debugger/add print statements. I've found that this is the most efficient way to go bug-hunting, because a quick re-read can find a lot of the "easy" bugs. This is similar to having a code review, but in this case you, the author, are the only reviewer. If there is another coder nearby, go ahead and ask him/her to give it a quick look as well, because he/she will probably have an easier time spotting the error.

I've met people who skip this step, and it drives me up the wall to see them waste their time (sometimes hours!) poking around in a debugger/writing print statements when the code they are debugging is simple. If it's a small, straightforward bit of code, then a quick look should uncover the bug. I suppose this falls under rule #1 (understand the system), but my point is more specific: understand the code.

None of the above is particularly groundbreaking, of course, and probably doesn't deserve to be mentioned in the book. These are more like "things you do before debugging".

Re: Heisenbugs... (Score:4, Insightful)

by gidds ( 56397 ) writes: <slashdot@@@gidds...me...uk> on Tuesday February 24, 2004 @08:06PM (#8380099) Homepage

Well, yes, but that determinism can be arbitrarily complex; causes may be very far removed from their effects. A GUI app can have a *lot* of past input to affect things, for example, especially if it runs for days or weeks. Exactly when asynchronous events happen can be extremely difficult to predict, detect, handle, or test; livelocks and race conditions are notoriously hard to track down. Even exact patterns of memory layout and allocation, file organisation or access, &c can affect subtle bugs. So while strictly true, determinism isn't a lot of help in some cases.

Missed one (Score:3, Insightful)

by dmiller ( 581 ) writes: <djm.mindrot@org> on Tuesday February 24, 2004 @11:13PM (#8382048) Homepage

There is one that appears to be left out (from the summary, perhaps not from the book - I haven't read it): fix it everywhere.

Once you have found a bug, search the rest of your tree for similar bugs. Chances are that you will find and fix several. This is especially true of bugs caused by bad assumptions.

FYI: This is one of the central audit methodologies of the OpenBSD project. It works much better for the BSDs as they keep the entire system in one CVS tree, rather than scattering it around FTP servers in the forms of tarballs. The whole system is readily available to search for entire classes of bugs.

The most important thing to prevent bugs (Score:3, Insightful)

by igomaniac ( 409731 ) writes: on Wednesday February 25, 2004 @05:03AM (#8384050)

I have a lot of experience in finding and fixing difficult bugs. In my experience, the most important thing you can do is when you find a bug, stop and think how you could have caught this bug automatically. If you practice this policy, you end up with very solid code. Basically, in the debug build, no function should ever crash the program no matter what garbage you put in the parameters - it should report an error and stop.

I think writing solid code is all in the attitude of the programmers - I had one guy who had a memory overwrite bug that was corrupting some characters in his string table when he called a certain function. Do you know how he fixed it? He wrote some code that put the right characters back over the corrupted ones after the call to this function!!! If you have that attitude, things WILL blow up in your face...

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Debugging 290

Debugging More Login

Debugging

Change one thing at a time (Score:5, Insightful)

He forgot regression tests (Score:5, Insightful)

Good read (Score:5, Insightful)

but how do you know it's fixed? (Score:5, Insightful)

Re:Hardware Debugging? (Score:5, Insightful)

Negative. (Score:1, Insightful)

Time (Score:5, Insightful)

Re:Hardware Debugging? (Score:5, Insightful)

Re:Hardware Debugging? (Score:3, Insightful)

Re:yuck (Score:2, Insightful)

Race Conditions? (Score:5, Insightful)

Re:Heisenbugs... (Score:5, Insightful)

I really liked the book, but I would have... (Score:5, Insightful)

Missed one: explain it to someone (Score:5, Insightful)

Missing rule (Score:3, Insightful)

Re:Heisenbugs... (Score:5, Insightful)

A missing rule (Score:5, Insightful)

Re:Good read (Score:5, Insightful)

An extra rule (Score:5, Insightful)

Fresh view: visit next lower level of abstraction (Score:4, Insightful)

Re:Now That It's Written Down (Score:3, Insightful)

Re:The first law of debugging (Score:3, Insightful)

"Thats a feature" (Score:3, Insightful)

Re:Race Conditions? (Score:3, Insightful)

Two of my favourite rules... (Score:3, Insightful)

Re:Heisenbugs... (Score:3, Insightful)

Re:Change one thing at a time (Score:3, Insightful)

Re: Heisenbugs... (Score:4, Insightful)

Missed one (Score:3, Insightful)

The most important thing to prevent bugs (Score:3, Insightful)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot

Change one thing at a time (Score:5, Insightful)

He forgot regression tests (Score:5, Insightful)

Good read (Score:5, Insightful)

but how do you know it's fixed? (Score:5, Insightful)

Re:Hardware *Debugging*? (Score:5, Insightful)

Negative. (Score:1, Insightful)

Time (Score:5, Insightful)

Re:Hardware *Debugging*? (Score:5, Insightful)

Re:Hardware *Debugging*? (Score:3, Insightful)

Re:yuck (Score:2, Insightful)

Race Conditions? (Score:5, Insightful)

Re:Heisenbugs... (Score:5, Insightful)

I really liked the book, but I would have... (Score:5, Insightful)

Missed one: explain it to someone (Score:5, Insightful)

Missing rule (Score:3, Insightful)

Re:Heisenbugs... (Score:5, Insightful)

A missing rule (Score:5, Insightful)

Re:Good read (Score:5, Insightful)

An extra rule (Score:5, Insightful)

Fresh view: visit next lower level of abstraction (Score:4, Insightful)

Re:Now That It's Written Down (Score:3, Insightful)

Re:The first law of debugging (Score:3, Insightful)

"Thats a feature" (Score:3, Insightful)

Re:Race Conditions? (Score:3, Insightful)

Two of my favourite rules... (Score:3, Insightful)

Re:Heisenbugs... (Score:3, Insightful)

Re:Change one thing at a time (Score:3, Insightful)

Re: Heisenbugs... (Score:4, Insightful)

Missed one (Score:3, Insightful)

The most important thing to prevent bugs (Score:3, Insightful)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Re:Hardware Debugging? (Score:5, Insightful)

Re:Hardware Debugging? (Score:5, Insightful)

Re:Hardware Debugging? (Score:3, Insightful)