320 likes | 547 Views
Bibliometric research methods. Faculty Brown Bag IUPUI Cassidy R. Sugimoto. Overview . Vocabularly Citation analysis Citation indices Bibliometric laws Impact factor Applications. Vocabulary. Scholarly Communications Formal and information Scientometrics Scientific communication
E N D
Bibliometric research methods Faculty Brown Bag IUPUI Cassidy R. Sugimoto
Overview • Vocabularly • Citation analysis • Citation indices • Bibliometric laws • Impact factor • Applications
Vocabulary • Scholarly Communications • Formal and information • Scientometrics • Scientific communication • Infometrics • Thinking beyond scholarly “texts” • Webometrics • web • Bibliometrics • Application of statistical and mathematical methods (formal channels)
Citation analysis • Why do people cite? • Why are some articles not cited? • What does a citation mean? Cited document Citing document A B A references B B is cited by A
Who’s on first? Embedded citation index from ` En mishpat: Babylonian Talmud (1546) (Weinberg, 1997) Shepard’s Citation Index (1873) Shapiro (1992)
Scopus n=7,333 (86%) Web of Science n=6,108 (71%) Scopus 29% (2,441) Overlap 57% (4,892) Web of Science 14% (1,216) Comparison Distribution of unique and overlapping citations in Scopus and Web of Science (n=8,549)
Cited document Citing document A B A references B B is cited by A Citation analysis
Citation analysis: methods Not just articles…
Hybrid approaches Chaomei Chen: http://www.pages.drexel.edu/~cc345/citespace/figures/terrorism1990-2003-300dpi.png
h-index • Hirsch (2005) • A scientist has index h if h of [his/her] Np papers have at least h citations each, and the other (Np − h) papers have at most h citations each.
Bibliometric laws • Lotka’s Law (1926) the number (of authors) making n contributions is about 1/n² of those making one; and the proportion of all contributors, that make a single contribution, is about 60 percent (60,15,7…6>10) Not statistically exact May be changing with the current model of scholarship
Bibliometric laws • Bradford’s law (1934) • Journals in a field can be divided into three parts: • Core: relatively few # of journals producing 1/3 of all articles • Zone 2: same # of articles, but > # of journals • Zone 3: same # of articles, but > # of journals • The mathematical relationship of the number of journals in the core to the first zone is a constant n and to the second zone the relationship is n². 1:n:n² Not statistically exact General power law distribution (akin to Pareto’s law in economics)
Bibliometric laws • Zipf’s Law (1935) listing the words occurring within that text in order of decreasing frequency, the rank of a word on that list multiplied by its frequency will equal a constant. The equation for this relationship is: r x f = k where r is the rank of the word, f is the frequency, and k is the constant James Joyce's Ulysses 10th most frequent: 2,653 times 100th most frequent: 265 times 200th most frequent: 133 times rank of the word multiplied by the frequency of the word equals a constant that is approximately 26,500 Not statistically exact General power law probability distribution
Bibliometric laws • Other power law probability distributions • Pareto’s law (economics) • 80-20 rule • Law of the vital few • Principle of factor sparsity • PageRank (google) • The Long Tail (markets)
As a research method… • Reliability? • Validity? • Limitations?
Applications? • Finding and use • Collection development • Reference services • Collection evaluation • Use studies • Information retrieval algorithms • Diffusion of ideas • Domain areas and interdisciplinarity • Mapping science