850 likes | 1.07k Views
This page intentionally left blank. Please focus here. Measurement-based Research Methods in Computer Engineering. Mats Björkman Mälardalens Högskola. Overview. Introduction Experimental-based research methodology Statistics Measurements Methodology Examples Pitfalls Conclusions.
E N D
This page intentionally left blank Please focus here Mats Björkman, MdH
Measurement-based Research Methods in Computer Engineering Mats Björkman Mälardalens Högskola Mats Björkman, MdH
Overview • Introduction • Experimental-based research methodology • Statistics • Measurements • Methodology • Examples • Pitfalls • Conclusions Mats Björkman, MdH
Introduction • Measurement-based research is founded in: • Experimental research methodology • Statistics Mats Björkman, MdH
Experimental-based research methodology • Overview (repetition) • Comments Mats Björkman, MdH
Experimental-based research methodology - Overview • Already the old Greeks… • Two main standpoints: • Rational methods, it all comes from the brain, everything can be thought out • Idealistic methods, everything we observe give us knowledge about the ideal world (e.g. Plato) Mats Björkman, MdH
Different Methodologies • Rational research meant thinking, thus deductive (logical) methodologies • Idealistic research meant observing and drawing conclusions, thus inductive methodologies Mats Björkman, MdH
Practice often a mixture • E.g. Astronomy, a combination of induction from observations and deduction through e.g. Mathematics Mats Björkman, MdH
Medieval times • Much debate around whether or not God stands above the laws of logic • The question “Why?” important • Research always seen through the glasses of religion Mats Björkman, MdH
The Scientific Revolution • Bacon, Copernicus, Kepler, Newton etc. • Focus on “How?” • Nature and religion may be treated separately as long as focus is on How • Led to the development of the “traditional” sciences Mats Björkman, MdH
Modern Science • Karl Popper - important philosopher • Science is a process of testing and refining hypotheses • Induction problem: Can experience be generalized? • Popper says ‘no’, experiments cannot prove general hypotheses Mats Björkman, MdH
Modern Science • Falsification is the most important feature of science according to Popper • Hypotheses cannot be proven, but they can be falsified by counter-examples • Theories are compared by their expressiveness and by their abilities to withstand falsification Mats Björkman, MdH
Modern Science • Since hypotheses cannot be generally proven, corroboration and statistics play important roles • Hypothetical-deductive research methods build on these views of science • However, corroboration versus verification can be and is discussed Mats Björkman, MdH
Hypothetical-deductive methods • Problem formulation • Hypothesis • Deduction to find evaluation criteria • Experiment(s)/observation(s) • Conclusion (corroboration/verification or falsification) Mats Björkman, MdH
Problem formulation • The research problem is formulated • Typically, a good problem is addressable through empirical studies Mats Björkman, MdH
Hypothesis • A hypothesis regarding the answer to the research question is formulated • This is the really creative part of research, scientific intuition and a good “educated guess” are important to success • (Popper: The more “risky” the hypothesis, the “better” the result.) Mats Björkman, MdH
Deduction • From the hypothesis, criteria are deduced, the criteria to be used to test the hypothesis Mats Björkman, MdH
Experiments/observations • The “hard work” part of research • Experiments are set up and/or observations are performed in order to corroborate/verify (or falsify) the hypothesis Mats Björkman, MdH
Corroboration/verification or falsification • Using the deduced criteria on the results of the experiments/observations leads to either corroboration/verification or falsification of the hypothesis Mats Björkman, MdH
Then iterate… • Modern scientific research is typically a series of hypothetical-deductive situations; each corroboration/ verification or falsification gives input to a new or modified research question etc. etc. • Through this process, our scientific theories are expanded and refined Mats Björkman, MdH
What is “the Truth”? • Experimental research is often more quantitative than qualitative • For quantitative results, confidence levels or margins of errors are used in attempts to “encircle” the Truth (should it exist) • Experiments are repeated and/or modified until confidence levels or error margins are satisfactory Mats Björkman, MdH
What is “the Truth”? • For qualitative results, we must also use statistics. • Even if we believe in induction and that the Truth is possible to find, there are always experimental errors and the like that makes 100% impossible to reach • Here too, repeated experiments are needed Mats Björkman, MdH
Conclusions • Experimental research is an iterative process • Potential falsification is important: experiments without risks are not interesting Mats Björkman, MdH
Conclusions • Examples: If we know the outcome beforehand, the experiment is of no scientific value. • If there is no way to falsify the hypothesis (e.g. pseudoscience), the experiment is of no scientific value. Mats Björkman, MdH
Experimental-based research methodology - Comments • 100 % does not exist in reality! Mats Björkman, MdH
Experimental-based research methodology - Comments • In reality, there is always a residual chance/risk that something really weird will happen Mats Björkman, MdH
Experimental-based research methodology - Comments • Therefore, Popper and his followers are maybe not wrong, but it is kind of irrelevant whether a hypothesis can be “generally proven” or not • (I’m trying to be provocative here…) Mats Björkman, MdH
Experimental-based research methodology - Comments • If I show that in X percent of all cases, some hypothesis Y holds… • …then according to Popper, this cannot prove the general case… • …but if 1 – X (the risk of Y not holding) is smaller than e.g. the risk of the world being ended by a comet… • …then who cares? Mats Björkman, MdH
Experimental-based research methodology - Comments • In reality, we always take calculated risks • If a hypothesis is true for all practical purposes, then it is an academic question (a philosophical question) whether or not the hypothesis is TRUE Mats Björkman, MdH
Experimental-based research methodology - Conclusions • Conclusion: The hypothetical-deductive method is the modern methodology in experimental research • However, not everyone agrees that hypotheses cannot be generally proven (and others don’t care…) Mats Björkman, MdH
An Experimental Example • The work I did for my PhD thesis around the performance of parallel implementations of communication protocols Mats Björkman, MdH
Parallel TCP and UDP stacks • On a shared-memory multiprocessor we implemented parallel TCP/IP/Ethernet and UDP/IP/Ethernet stacks • The performance behavior of these stacks gave rise to the research question “What factors limit the performance of these parallel implementations?” Mats Björkman, MdH
Performance limiting factors • In parallel processing, critical resources must be protected from simultaneous access, in our case by using locks • Hence, these critical sections were main suspects as performance limiting factor Mats Björkman, MdH
Our research hypothesis • Our hypothesis then was “locking is the main performance limiting factor” • We built a performance model using only locking and processing • If our hypothesis was right, then the model should behave like the real system Mats Björkman, MdH
Experiment • Our experiment was to run the model with the same input as the real implementation and compare results Mats Björkman, MdH
Results • For the TCP stack, results were fairly accurate for low numbers of processors (but far from perfect). • Conclusion: locking “is probably” one major factor (but not the only) Mats Björkman, MdH
Results • For the UDP stack, results differed widely. • Conclusion: locking is not a major factor here Mats Björkman, MdH
Results from conclusions • We need to rethink and refine (iterate) • Locking obviously is one factor, but not the only • Need to think again and formulate a new hypothesis Mats Björkman, MdH
New hypothesis • Next to contention for shared software resources, contention for shared hardware resources (e.g. buses, memory) is a likely candidate • New hypothesis: Contention for locks and contention for the bus/memory system are the two main factors Mats Björkman, MdH
New model • We then built a new model that captured the effects of both locking and bus/memory contention • The same evaluation criteria as before, model and reality should agree Mats Björkman, MdH
New results • For the new model, the TCP results were very good Mats Björkman, MdH
New results • While not perfect, UDP results also showed that our new model captured the main behavior of the UDP stack Mats Björkman, MdH
New results • Conclusion: Lock and bus/memory contention “are” the two main performance limiting factors for the observed implementations Mats Björkman, MdH
Statistics Statistics are used for many purposes: • Quantify results • Measure confidence • Statistics needed in corroboration process Mats Björkman, MdH
Statistics – Result quantification • Assume we are measuring some property P that has a certain (but to us unknown) value V • When measuring, we get a measured value V* Mats Björkman, MdH
Statistics – Result quantification • How is V and V* related? • It depends on our measurement methods • Ideally, our measurement method gives an exact result, i.e. V* = V Mats Björkman, MdH
Statistics – Result quantification • However, most measurement methods are: • Statistical by their nature • Inexact • Deliberately simplified (model) Mats Björkman, MdH
Statistics – Measurement methods • Statistical by nature: • Sampling a typical example: • Instead of observing a long and possibly continuous process, we take a number of snapshots • These snapshots are statistically representative of the process Mats Björkman, MdH
Statistics – Measurement methods • Example: Counting cars (vehicles) • We want to know the number of cars/vehicles passing outside Rosenhill on one day • Instead of counting for 24 hours, we can count 10 randomly chosen minutes and multiply by 144. Mats Björkman, MdH
Statistics – Measurement methods • Inexactness: • If our measurement tools have lower resolution than the property we are measuring, we introduce measurement errors Mats Björkman, MdH