Bibliometrics (6): Extracting Bibliography Data from Web of Science database


Hi in this video. I will show you how to extract bibliography data from SEO web of science database so this bibliographic data could be used for citation analysis co-citation and bibliographic coupling analysis and also Corcoran's and co word and the courtship analysis. And you may also use them for different kind of content analyzed purposes as well and here. I'll be using as a web of science database you can also have is to pass data pubmed. There are many other databases but for now we'll focus on websites database and it is one of the largest one as you know and it has coverage since nineteen hundreds with more than nine hundred with more than 90 million plus records and almost all the journals and articles recorded in is is a CIS CI or emerging source citation index. Your notes are listed here so we'll have a wide range of data actually so now one important thing is you need to have access to these siu websites database through your institution so now here. I'm already on the website of Yui web signs and here you see we can search here and just for instance just to give you an example. I'm taking two things like for instance if I want to do a citation analysis of theories in international business. I can maybe write theory. Yeah an international business. I here I'm putting theory and international business in quotation marks so that only the items and articles which which has exactly these two terms will show up so here I go and fix search so I have here about 774 articles. That's nice maybe you are honoring that if all theories in international business are covered by this search because sometimes some theories are also referred as framework. I mean they're not to use their framework but they're kind of used as theories as well so those are kind of ignored here so we'd like to add framework as well so we can write like theory our our a frame work so we do it like this till we can bring more theory or framework and international business so now we will have the articles which has either theory or framework and international business so we give search now we get about 1000 and 126 articles.

Okay that's nice sometimes it can also happen that international business and international management are kind of relevant also strategy and yeah an organization studies. These are kind of relevant so maybe you are not interested in them. Maybe you are interested in them so just to show you like you can also add actually maybe management international business and not n here it should be or our management but then I can put it like this. International should be here and there should be an and yeah international and business or management um the other side we have here the theory and frameworks okay. So we click search again. Something is missing here. I miss the corrosion mark so now I get about nine thousand and two hundred articles and this is a lot for analysis so maybe you were not really interested in all these articles. Maybe you are interested in only the most cited articles here you can refine by this like and here you have a hot papers open access papers. Maybe you are very interested in highly said articles then you can just refine by this also maybe you are just interested in articles in last two years so you can just click here and refine by this so for now what I will do is. I'll actually avoid this management part and I will just keep it limited to international business to race and let's see top it. I get one thousand and one hundred and twenty six articles so here I you may also refine by fields like business and management and economics. Maybe and also it sorry it depends on you can refine with many other things usually when you try to do something. New metric analyzes you try to avoid the review and editorial and proceedings and book review so we will refine by articles only so. That's we get 998 okay there could be some other refine criterias as well like open access authors and there are many other options you can see here yeah language we will.

We will actually use it. We'll make it to English on English and now let's do this let's Rifai. Let's limit it to last five years not plus five years but yet 15 - yeah so I have refined it to this last five years. Now we have only 498 articles now. I'd like to extract this 498 articles. Just have it. In mind that for your field it could be different search codes and different keywords and you may would like to have different they find criterias but I'm just giving a giving an example so you have to play with it and you have to find out what suits pets in your case so for isness. I will make it fifty per page so here before we really extract this data all these particles like 498 here we should actually go and breed at least the abstract of each of the article to really understand that it really fits with our purpose. It really fits with what we are trying to what you will try to do in your in your research. I mean if it is really related with theory and international business and it could be different things for your study so but yeah then you can actually remove it from the list. I mean you just don't select that one but in my case just to show you now I'll select all of them yeah I will select all of them here. I have 50. I go to the next page I select all of them select all of them so if you want to leave if you want to exclude any of them you just just unpicked it and I'll take it on. Yeah it's fine. It's not there maybe you don't want to have this one as well so you don't have there but for now. I'll have all of it and next page I select all of it here. You see mark list 100 so now it should be 150 and again. I select all of it so it's adding up in the mark list 200 more 100 more think. I missed a little bit yet this page. I missed so it seems I missed also the page before. That's it so. I have all this articles the mark list and I go on the mark list I would extract these articles for vintage catalysis using some other software's and what I will do here is I will select all of it.

You can actually like choose which one you want to select and which not but I'm selecting all of it and then you click here save to other formats you may choose different ones but I would choose plain text because I will be using this format for other softwares so send it should be download it right now. Yeah this is here. It's downloading you see so you see it looks like this. It's a text file with a lot of information but we cannot really read and that's fine so before we can use it with some other software if few things has to be done we have downloaded all the marked articles and now we have to save it. We have to re save the file download file and we have to make the encoding ans. I and also when we save it we have to rename. We have to replace the first sentence with this one so. I'm going to copy it from here so I just copy the first sentence and I will replace it here and now save it save us. I'll rename it as 3ib every 498 as well and I will change the encoding to an acai and I'll save it and that's fine just to show you if the data works I have opened here his site software and I will open that file browse and add finer should work nice so here you see it works. Here we have 498 articles we didn't buy 1114 authors in 163 journals and with a lot of site at their friends and a lot of keywords. Thank you for watching. If you find it useful like share and comment bye bye.