Ian Baldwin (Max Planck Institute): Making scientific writing painless
My name is Ian Baldwin. I'm a Director of the Max Planck Institute for Chemical Ecology in Jena, Germany. And I want to talk in this episode about one of the most painful processes that you have to master as a scientist, that is, writing scientific publications. And I hope I can give you some tips on how to make this process less painful. It's all written up in a little document that's on my home page. You can find it there. It's called the Baldwin formula for writing scientific papers and reviews, if you wanted to catch a written version of this. One of the points I want to make about writing is that it's an obligation. Basically, if you don't publish your research, you haven't done the research. So, you have to come to terms with this process of writing publications. One of the other points I want to make is that it is critical to the scientific process itself. Writing sharpens your critical thinking. And there is... it is really something that one has to do while you're experimenting. It's not something you do after you've finished experiments. It's a process that's integral to the experimentation process, in particular, honing what I called the red line, and I'll talk more about the red line in a few minutes. I believe that writing is like... something that I consider scientific oral hygiene. It is something, like brushing your teeth, you should do every day. And one should write in such a way that you reward yourself by being successful. And there's nothing worse than trying to write a manuscript from start to finish, having to confront the entire thing, so break it down into small bite-sized pieces that you can accomplish each day, and thereby stimulate that dopamine reward system that comes with accomplishment. Another point I want to make is that writing... scientific writing... it occurs in English. But science English is a foreign language. It's even a foreign language to native speakers. And therefore, it's a process that has to be practiced and honed as if it was a foreign language.
And the important part here is the clarity of the language, not the elegance with which you say things. It's clarity that counts in science. Another major challenge of scientific writing... and it's really true for all types of writing, but in particular science, where you have to be able to diagnose the logic that you have tried to encrypt in your language... and that is erasing the memory of what you have written. Because when you've written something, there's a memory of that, and your brain will say, yes, I understand what I have written, but you don't really see the words and the logic behind it, because of this particular memory that's echoing in your brain. And the solution that works for me, and for many of my students, is what I call the Men in Black solution, which is basically... if you've seen the movie, you know exactly what I'm talking about... this process of having some trigger that allows you to erase, completely, what you've just worked on in your memory. And there are lots of different ways of erasing your memory. You can use... you can print out what you've just written in a different font. That helps in breaking up your visual memory of it. You can have someone else read it out loud to you, and thereby the logic and the words come in through your ears, through a different sensory system, rather than through your eyes, and you can hear it as if you had written it for the first time, and be able to be critical of it. You can work on texts that you haven't worked on recently. And that's another way... just simply, the passage of time fades your memory. But for me, that's the process that works least effectively, and usually some of those other processes work more effectively in the erasing process. You can practice the editing skills on the words of others. And any time that you have the opportunity to review the work that someone else has done, you should jump at it, because a good editor becomes a good writer. These are very intertwined skills.
The hard part is just editing your own words. Now, the key to good scientific writing is to have a good red line. Now, the red line consists, basically, of three parts that you already know about a title, an abstract, and figures and captions and tables that... and the red line defines the logic of your paper. This red line has to be... has to be the most... it's the most fundamental part of your paper. If the logic of this line isn't good, everything else you write from here on has to... has to be done over again. So, I always argue that you should be spending about 70% of the time that you write a paper working on the logic of this particular red line. It is absolutely fundamental to your paper. Now, the way you start with the red line is to start with the abstract. Now, the abstract is not the abstract that you'll publish in a paper, but really a limited, 500 words or so narrative that defines the logic of your paper, and the arguments, and the order in which those arguments get presented. After you've written the abstract, you can highlight the key words in there and construct a title from that abstract. The title, of course, should not be filled with a lot of boring things like "the effect of this on that," but rather really encapsulate the main take-home message of your paper from the keywords in your abstract. Figures should not be a place where you simply dump data. Figures should represent one of five or six major take-home messages that you are presenting in the paper. The figure should be understandable in 30 seconds. And the figure should best be done in a handwritten form in the first draft. I always suggest that people write, in words, on the back of a piece of paper what they want that take-home message to be from the paper... from that.. from that particular figure, and then hand draw the figure, with the axes labeled nicely. And then, once you've hand drawn that figure, hand it to a naive colleague, a colleague who does not understand your story, and then find out, in 30 seconds, whether that colleague can understand what the take-home message is from that figure.
If he or she can't understand it in 30 seconds, then you need to go back and reconsider your figure. Now, sometimes you can get a figure to... to articulate a take-home message in 30 seconds with some simple tricks. So, for example, that 30 second rule is not gonna be matched by this particular figure, which is produced by commercial software, where there's a legend produced on the far axis, which gives you the information to understand the three different colored lines here. If you had simply labeled those lines directly, putting the labels right on the line, you could have picked up exactly what that figure says in 30 seconds. So, small changes in a figure can have a dramatic influence on how it takes a reader to understand what the take-home message is. Okay. So, now you have this red line. The red line is, as I say, the fundamental part of your paper. It's the part that you should be practicing and working on as you do experiments. This is not something that you wait until the end, because if you understand and you hone this red line, the rest of your paper will fall completely in place. So, with a good red line, you're now ready to start on the other parts of the paper. And papers are frequently broken down into different sections, like Introduction, Materials and Methods, and Results and Discussion, but not all journals have those separate sections. Sometimes they blend them all together. Sometimes they break 'em... break Results and Discussions into one package or another. So, you need to know the journal that you're targeting. Now, Materials and Methods deserve a little bit of a discussion here, because it is this part of a manuscript that has become smaller and smaller as journals have... particularly those journals that want to stay with a paper publication... have tried to go electronic. So, Materials and Methods are supposed to describe exactly how you've done the experiments so that someone else can repeat them and reproduce your results.
And reproducibility, after all, is the hall... hallmark of science, so you want to make sure that this part of the paper is done well. Now, as a consequence, most journals have made... because of the space limitations, they've made the Materials and Methods shorter and shorter, more and more telegraphic, harder and harder to understand, and moved most of the Materials and Methods into what's called the Supplemental Online Materials, the SOMs, and usually are... require extra digging to go find. Now, times are changing in the world of publications, and things are becoming electronic, and there are all these other venues to give detailed accounts of your Materials and Methods. For example, Bio-protocol is just one of them, where you can do complete videos of all the procedures that you use in your experiments and make it very easy for a scientist to replicate what you've done. So, take adva... take advantage of those opportunities, and make sure that your Materials and Methods really provide the right information that you need. Now, I want to talk also about the Introduction. Because the Introduction, particularly for a young scientist, is one of the biggest challenges, because you're just staring at this big piece of white paper, and you need to construct all the things that have gone into your mind that you think are necessary to introduce the work that you're gonna present. And that big blank piece of paper can be daunting. And frequently, when you start that process, you end up with a 30-page Introduction that no journal ever wants to publish. It has far too much in there. So, the best way to hone your activities and make the writing of an Introduction an efficient process is simply to start with the last paragraph of the introduction. That last paragraph, where you say, basically, here is the logic of the experiments, here's why we did these experiments, and.
.. and go through in a... in a distinct, concise way, summarizing, basically, the Introduction. When you've written that last paragraph, you can go through with your highlighter and just identify the concepts that need to be int... need to be introduced in the other parts of your introduction. So, basically, what you're doing is you're taking that last paragraph, and then you're reverse engineering the rest of the Introduction as defined by the key points in that last paragraph. This allows you to write a very efficient, compact Introduction without a lot of pain. The second part I want to talk about is the Results and Discussion section. In some journals, they require them to be blended together. But in other journals, they acquire them to be separate. And it's important to understand the differences between a result and the discussion. What goes into what, and what goes into the other? The results are not the same as the discussion. The results should present exactly those points that you... that logically arise from the data that you have, that don't require a discussion. The data is so good, the data is so solid, that you can simply conclude X from the data that you have. And so, they should draw logically from those results, they don't require a discussion, and they should follow the order that is depicted in the red line that you have. The Discussion, on the other hand, should be those points... those inferences that you make from your data that require discussion. The data is not so solid, so you can... so not... so not every reader is going to agree that you can draw the following inference or the following conclusion from the data than you have. So, that's what goes into the Discussion. That's the main meat of a Discussion part. And of course, the Discussion also includes interpretation of hypotheses. But the Discussion also allows you an opportunity to protect the integrity of the paper that you wanted to write, and to make sure that it's not.
.. becomes the paper that the reviewers want you to write. And therefore, the Discussion offers the opportunity to anticipate criticisms, to say, we understand that this experiment has other... other interpretations. And here, in the future, one could imagine doing the following experiment that would clarify it. That basically allows you to control what you consider being inside the scope of your paper and things that will be outside the scope of your paper. And it prevents reviewers from coming back at you, saying, you've got to do the ten extra follow... the following ten extra experiments before we'll be able to accept your paper. So, that is a point that is a very valuable thing to contribute to your Discussion, and it also allows you to sort of increase the level of scientific discourse about the manuscript that you have. So, those are the main points that I have about scientific writing. That it's an essential part of the critical thinking process. It's something that you should be doing while you're experimenting, particularly that red line. And it's something that you should do every day, particularly at the time when the rating... the writing part of your brain works best. For me, that's mornings, but for other people it's late at night. And you should just do your writing at the time when it's least painful. So, I hope these thoughts will help you be able to carry out this really important part of your professional obligation, which is to write up your experiments and have them ready for peer review. Thanks for your attention.
And the important part here is the clarity of the language, not the elegance with which you say things. It's clarity that counts in science. Another major challenge of scientific writing... and it's really true for all types of writing, but in particular science, where you have to be able to diagnose the logic that you have tried to encrypt in your language... and that is erasing the memory of what you have written. Because when you've written something, there's a memory of that, and your brain will say, yes, I understand what I have written, but you don't really see the words and the logic behind it, because of this particular memory that's echoing in your brain. And the solution that works for me, and for many of my students, is what I call the Men in Black solution, which is basically... if you've seen the movie, you know exactly what I'm talking about... this process of having some trigger that allows you to erase, completely, what you've just worked on in your memory. And there are lots of different ways of erasing your memory. You can use... you can print out what you've just written in a different font. That helps in breaking up your visual memory of it. You can have someone else read it out loud to you, and thereby the logic and the words come in through your ears, through a different sensory system, rather than through your eyes, and you can hear it as if you had written it for the first time, and be able to be critical of it. You can work on texts that you haven't worked on recently. And that's another way... just simply, the passage of time fades your memory. But for me, that's the process that works least effectively, and usually some of those other processes work more effectively in the erasing process. You can practice the editing skills on the words of others. And any time that you have the opportunity to review the work that someone else has done, you should jump at it, because a good editor becomes a good writer. These are very intertwined skills.
The hard part is just editing your own words. Now, the key to good scientific writing is to have a good red line. Now, the red line consists, basically, of three parts that you already know about a title, an abstract, and figures and captions and tables that... and the red line defines the logic of your paper. This red line has to be... has to be the most... it's the most fundamental part of your paper. If the logic of this line isn't good, everything else you write from here on has to... has to be done over again. So, I always argue that you should be spending about 70% of the time that you write a paper working on the logic of this particular red line. It is absolutely fundamental to your paper. Now, the way you start with the red line is to start with the abstract. Now, the abstract is not the abstract that you'll publish in a paper, but really a limited, 500 words or so narrative that defines the logic of your paper, and the arguments, and the order in which those arguments get presented. After you've written the abstract, you can highlight the key words in there and construct a title from that abstract. The title, of course, should not be filled with a lot of boring things like "the effect of this on that," but rather really encapsulate the main take-home message of your paper from the keywords in your abstract. Figures should not be a place where you simply dump data. Figures should represent one of five or six major take-home messages that you are presenting in the paper. The figure should be understandable in 30 seconds. And the figure should best be done in a handwritten form in the first draft. I always suggest that people write, in words, on the back of a piece of paper what they want that take-home message to be from the paper... from that.. from that particular figure, and then hand draw the figure, with the axes labeled nicely. And then, once you've hand drawn that figure, hand it to a naive colleague, a colleague who does not understand your story, and then find out, in 30 seconds, whether that colleague can understand what the take-home message is from that figure.
If he or she can't understand it in 30 seconds, then you need to go back and reconsider your figure. Now, sometimes you can get a figure to... to articulate a take-home message in 30 seconds with some simple tricks. So, for example, that 30 second rule is not gonna be matched by this particular figure, which is produced by commercial software, where there's a legend produced on the far axis, which gives you the information to understand the three different colored lines here. If you had simply labeled those lines directly, putting the labels right on the line, you could have picked up exactly what that figure says in 30 seconds. So, small changes in a figure can have a dramatic influence on how it takes a reader to understand what the take-home message is. Okay. So, now you have this red line. The red line is, as I say, the fundamental part of your paper. It's the part that you should be practicing and working on as you do experiments. This is not something that you wait until the end, because if you understand and you hone this red line, the rest of your paper will fall completely in place. So, with a good red line, you're now ready to start on the other parts of the paper. And papers are frequently broken down into different sections, like Introduction, Materials and Methods, and Results and Discussion, but not all journals have those separate sections. Sometimes they blend them all together. Sometimes they break 'em... break Results and Discussions into one package or another. So, you need to know the journal that you're targeting. Now, Materials and Methods deserve a little bit of a discussion here, because it is this part of a manuscript that has become smaller and smaller as journals have... particularly those journals that want to stay with a paper publication... have tried to go electronic. So, Materials and Methods are supposed to describe exactly how you've done the experiments so that someone else can repeat them and reproduce your results.
And reproducibility, after all, is the hall... hallmark of science, so you want to make sure that this part of the paper is done well. Now, as a consequence, most journals have made... because of the space limitations, they've made the Materials and Methods shorter and shorter, more and more telegraphic, harder and harder to understand, and moved most of the Materials and Methods into what's called the Supplemental Online Materials, the SOMs, and usually are... require extra digging to go find. Now, times are changing in the world of publications, and things are becoming electronic, and there are all these other venues to give detailed accounts of your Materials and Methods. For example, Bio-protocol is just one of them, where you can do complete videos of all the procedures that you use in your experiments and make it very easy for a scientist to replicate what you've done. So, take adva... take advantage of those opportunities, and make sure that your Materials and Methods really provide the right information that you need. Now, I want to talk also about the Introduction. Because the Introduction, particularly for a young scientist, is one of the biggest challenges, because you're just staring at this big piece of white paper, and you need to construct all the things that have gone into your mind that you think are necessary to introduce the work that you're gonna present. And that big blank piece of paper can be daunting. And frequently, when you start that process, you end up with a 30-page Introduction that no journal ever wants to publish. It has far too much in there. So, the best way to hone your activities and make the writing of an Introduction an efficient process is simply to start with the last paragraph of the introduction. That last paragraph, where you say, basically, here is the logic of the experiments, here's why we did these experiments, and.
.. and go through in a... in a distinct, concise way, summarizing, basically, the Introduction. When you've written that last paragraph, you can go through with your highlighter and just identify the concepts that need to be int... need to be introduced in the other parts of your introduction. So, basically, what you're doing is you're taking that last paragraph, and then you're reverse engineering the rest of the Introduction as defined by the key points in that last paragraph. This allows you to write a very efficient, compact Introduction without a lot of pain. The second part I want to talk about is the Results and Discussion section. In some journals, they require them to be blended together. But in other journals, they acquire them to be separate. And it's important to understand the differences between a result and the discussion. What goes into what, and what goes into the other? The results are not the same as the discussion. The results should present exactly those points that you... that logically arise from the data that you have, that don't require a discussion. The data is so good, the data is so solid, that you can simply conclude X from the data that you have. And so, they should draw logically from those results, they don't require a discussion, and they should follow the order that is depicted in the red line that you have. The Discussion, on the other hand, should be those points... those inferences that you make from your data that require discussion. The data is not so solid, so you can... so not... so not every reader is going to agree that you can draw the following inference or the following conclusion from the data than you have. So, that's what goes into the Discussion. That's the main meat of a Discussion part. And of course, the Discussion also includes interpretation of hypotheses. But the Discussion also allows you an opportunity to protect the integrity of the paper that you wanted to write, and to make sure that it's not.
.. becomes the paper that the reviewers want you to write. And therefore, the Discussion offers the opportunity to anticipate criticisms, to say, we understand that this experiment has other... other interpretations. And here, in the future, one could imagine doing the following experiment that would clarify it. That basically allows you to control what you consider being inside the scope of your paper and things that will be outside the scope of your paper. And it prevents reviewers from coming back at you, saying, you've got to do the ten extra follow... the following ten extra experiments before we'll be able to accept your paper. So, that is a point that is a very valuable thing to contribute to your Discussion, and it also allows you to sort of increase the level of scientific discourse about the manuscript that you have. So, those are the main points that I have about scientific writing. That it's an essential part of the critical thinking process. It's something that you should be doing while you're experimenting, particularly that red line. And it's something that you should do every day, particularly at the time when the rating... the writing part of your brain works best. For me, that's mornings, but for other people it's late at night. And you should just do your writing at the time when it's least painful. So, I hope these thoughts will help you be able to carry out this really important part of your professional obligation, which is to write up your experiments and have them ready for peer review. Thanks for your attention.