140 likes | 363 Views
Introduction to Applied Statistical Analysis. A practical course for handling datasets. Statistical analysis. T he science of collecting, exploring and presenting large amounts of data to discover underlying patterns and trends The first step is basic exploratory data analysis
E N D
IntroductiontoAppliedStatisticalAnalysis A practicalcourseforhandlingdatasets
Statisticalanalysis • The science of collecting, exploring and presenting large amounts of data to discover underlying patterns and trends • The firststep is basicexploratorydataanalysis • B.E.D.A. has two main roles: • 1. todefinebasicstatisticalvalues • 2. tovisualizedata
Everydaystatistics (youcanfindotherfactsat http://www.tylervigen.com/)
Softwaresforstatisticalanalysis • Open sourcestatisticalpackages • Public domainstatisticalpackages • Freeware statisticalpackages • Proprietarystatisticalpackages (e.g. OriginPro) • Microsoft Excel is notforthispurpose!
Whywedon’t use Microsoft Excel foranalysis? • Basic functionsand formulas(forgettingaverageanddeviation) • Data visualizationwithgraphs and diagrams • Seemsimpressive (withgraphicalsettings) • Mathematicallyinaccurate (I’ll show you) • Scientificcontentcan be hardlypresentedinsomecases • AnalysisToolPak is needed • This program wasnotdevelopedfordataanalysis
Practice session: objectivesforOrigin • Objective 1. statisticalanalysis, basicplotrepresentations (curve, column, waterfall), linear and nonlinearcurvefitting, BoxPlot • Objective 2. histogram, distributioncurve, scatterplot, Q-Q plot, axisvaluesettings, colourcodemodifications • Objective 3. makingenthalpyprofile, solid vs. dashed line, otheraxisproperties, adding text tograph • Objective 4. datasetconversiontomatrix, matrixrepresentationwithdifferentsurfacetechniques + Graph export topicture (atthe end, foreveryobjectives)
Objective 1 • Statisticalanalysis, basicplotrepresentations (curve, column, waterfall), linear and nonlinearcurvefitting, BoxPlot • Dataset: vibrationalfrequencies of opiorphin (QRFSR), inallits epimers • Forfitting: rotation of Asn
Objective 2 • Histogram, distributioncurve, scatterplot, Q-Q plot, axisvaluesettings, colourcodemodifications • Dataset: potentialenergies (in kcal/mol), radii of gyration and number of H-bondsin opiorphin epimers
Objective 3 • Makingenthalpyprofile, solid vs. dashed line, otheraxisproperties, adding text tograph • Dataset: enthalpies (in kJ/mol) forthereactionsystem of HCN + OH radical
Objective 4 • Datasetconversiontomatrix, matrixrepresentationwithdifferentsurfaces • Dataset: χ1, χ2and ∆rE(inhartree)