Dokumentation von PhyPraKit¶

Übersicht:

PhyPraKit ist eine Sammlung von Funktionen in der Sprache python (vers. 2.7) zur Visualisierung und Auswertung von Daten in den physikalischen Praktika. Beispiele illustrieren jeweils die Anwendung.

Version der Dokumentation vom Okt. 2016

Indices and tables¶

Darstellung und Auswertung von Messdaten¶

In allen Praktika zur Physik werden Methoden zur Darstelllung und Auswertung von Messdaten benötigt. Die Script- und Programmiersprache python mit den Zusatzpaketen numpy und matplotlib ist ein universelles Werkzeug, um die Wiederholbarkeit und Reprodzierbarkeit von Datenauswertungen zu gewährleiseten.

In der Veranstaltung “Computergestützte Datenauswertung” (http://www.ekp.kit.edu/~quast/CgDA), die im neuen Studienplan für den Bachelorstudiengang Physik am KIT seit dem Sommersemester 2016 angeboten wird, werden Methoden und Software zur grafischen Darstellung von Daten, deren Modellierung und Auswertung eingeführt.

Die folgen Links erlauben einen schnellen Überblick:

Zusammenfassung der Vorlesung und Dokumentation der Code-Beispiele

http://www.ekp.kit.edu/~quast/CgDA/CgDA-html/CgDA_ZusFas.html

Installation der Software auf verschiedenen Platformen

Dokumentation in html: http://www.ekp.kit.edu/~quast/CgDA/CgDA-SoftwareInstallation-html

Dokumentation in pdf: http://www.ekp.kit.edu/~quast/CgDA/CgDA-SoftwareInstallation.pdf

Softwarepakete: http://www.ekp.kit.edu/~quast/CgDA/Software

Speziell für das “Praktikum zur klassischen Physik” finden sich eine kurze Einführung (http://www.ekp.kit.edu/~quast/CgDA/PhysPrakt/CgDA_APraktikum.pdf) sowie die hier dokumentierten einfachen Beispiele als Startpunkt für eigene Auswertungen (http://www.ekp.kit.edu/~quast/CgDA/PhysPrakt/).

Dokumentation der Beispiele¶

PhyPraKit.py ist ein Paket mit nützlichen Hilfsfunktionen

zum import in eigene Beispielen mittels:
import PhyPraKit as ppk
from PhyPraKit import ...

test_readColumnData.py ist ein Beispiel zum Einlesen von Spalten aus Textdateien; die zugehörigen Metadaten können ebenfalls an das Script übergeben werden und stehen so bei der Auswsertung zur Verfügung.
test_readPicoScope liest Ausgabedateien von PicoScpe
test_labxParser liest Ausgabedateien von Leybold CASSY im .labx-Format. Die Kopfzeilen und Daten von Messreihen werden als Listen in python zur Verfügung gestellt.
test_kRegression.py dient zur Anpassung einer Geraden an Messdaten mit Fehlern in Ordinaten- und Abszissenrichtung und mit allen Messpunkten gemeinsamen (d. h. korrelierten) relativen oder absoluten systematischen Fehlern mit dem Paket kafe.
test_linRegression.py ist eine einfachere Version mit python-Bordmitteln zur Anpassung einer Geraden an Messdaten mit Fehlern in Ordinaten- und Abszissenrichtung. Korrelierte Unsicherheiten werden nicht unterstützt.
test_kFit.py ist eine verallgemeinerte Version von test_kRegression und dient zur Anpassung einer beliebigen Funktion an Messdaten mit Fehlern in Ordinaten- und Abszissenrichtung und mit allen Messpunkten gemeinsamen (d. h. korrelierten) relativen oder absoluten systematischen Fehlern mit dem Paket kafe.
test_Histogram.py ist ein Beispiel zur Darstellung und statistischen Auswertung von Häufigkeitsverteilungen (Histogrammen) in einer und zwei Dimensionen.
test_generateXYata.py zeigt, wie man mit Hilfe von Zufallszahlen “künstliche Daten” zur Veranschaulichung oder zum Test von Methoden zur Datenauswertung erzeugen kann.

Weitere python-Skripte als Beisepiele zur Anwendung von Modulen in PhyPraKit:

kfitf.py ist ein Kommandozeilen-Werkzeug, mit dem man komfortabel Anpassungen ausführen kann, bei denen Daten und Fit-Funktion in einer einzigen Datei angegeben werden. Beispiele finden sich in den Dateien mit der Endung .fit.
Beispiel_Drehpendel.py demonstriert die Analyse von am Drehpendel mit CASSY aufgenommenen Daten. Enthalten sind einfache Funktionen zum Filtern und bearbeiten der Daten, Suche nach Extrema, zur Anpassung einer Einhüllenden, zur diskreten Fourier-Transformation und zur Interpolation von Messdaten mit kubischen Spline-Funktionen.
Beispiel_Hysterese.py demonstriert die Analyse von Daten, die mit einem USB-Oszilloskop der Marke PicoScope am Versuch zur Hysterese aufgenommen wurden. Die aufgezeichneten Werte für Strom und B-Feld werden in einen Zweig für steigenden und fallenden Strom aufgeteilt, mit Hilfe von kubischen Splines interpoliert und dann integriert.
Beispiel_Wellenform.py zeigt eine typische Auswertung

periodischer Daten am Beispiel der akustischen Anregung eines Metallstabs. Genutzt werden Fourier-Transformation und eine Suche nach charakteristischen Extrema. Die Zeitdifferenzen zwischen deren Auftreten im Muster werden bestimmt, als Häufgkeitsverteilung dargestellt und die Verteilungen statistisch ausgewertet.

Die übrigen python-Scripte im Verzeichnis wurden zur Erstellung der in der einführenden Vorlesung gezeigten Grafiken verwendet.

Für die Erstellung von Protokollen mit Tabellen, Grafiken und Formeln bietet sich das Textsatz-System LaTeX an. Die Datei Protokollvorlage.zip enthält eine sehr einfach gehaltene Vorlage, die für eigene Protokolle verwendet werden kann. Eine sehr viel umfangreichere Einführung sowie ein ausführliches Beispiel bietet die Fachschaft Physik unter dem Link https://fachschaft.physik.kit.edu/drupal/content/latex-vorlagen

Modul-Dokumentation¶

Collection of Tools for Data Analysis, contains the follwoing functions:

Data input:

readColumnData() read data and meta-data from text file

readPicoScope() read data from PicoScope

readCassy() read CASSY output file in .txt format

labxParser() read CASSY output file, .labx format

signal processing:

offsetFilter subtract an offset in array a

meanFilter apply sliding average to smoothen data

resample average over n samples

Fourier_fft fast Fourier transformation of an array

FourierSpectrum Fourier transformation of an array (slow, preferably use fft version)

simplePeakfinder find peaks and dips in an array comparing neighbouring samples use convolutionPeakfinder

convolutionPeakfinder find peaks and dips in an array

statistics:

wmean() weighted mean

histograms tools:

barstat() statistical information bar chart

nhist() histogram plot based on np.historgram() and plt.bar() use matplotlib.pyplot.hist() instead

histstat() statistical information from 1d-histogram

nhist2d() 2d-histotram plot based on np.histrogram2d, plt.colormesh() use matplotlib.pyplot.hist2d() instead

hist2dstat() statistical information from 1d-histogram

profile2d() “profile plot” for 2d data

chi2p_indep2d() chi2 test on independence of data

linear regression:

linRegression() linear regression, y=ax+b, with analytical formula

linRegressionXY() linear regression, y=ax+b, with x and y errors ! deprecated, use `odFit` with linear model instead

odFit() fit function with x and y errors (scipy ODR)

kRegression() regression, y=ax+b, with x-, y- and correlated errors ! deprecated, use `kFit` with linear model instead

kFit() fit function with x-, y- and correlated errors (kafe)

simulated data with MC-method:

smearData() add random deviations to input data

generateXYdata() generate simulated data

PhyPraKit.FourierSpectrum(t, a, fmax=None)¶

Fourier transform of amplitude spectrum a(t), for equidistant sampling times

(a simple implementaion for didactical purpose only, consider using Fourier_fft() )

Args:

t: np-array of time values

a: np-array amplidude a(t)

Returns:

arrays freq, amp: frequencies and amplitudes

PhyPraKit.Fourier_fft(t, a)¶

Fourier transform of the amplitude spectrum a(t)

method:

uses numpy.fft and numpy.fftfreq; output amplitude is normalised to number of samples;

Args:

t: np-array of time values
a: np-array amplidude a(t)

Returns:

arrays f, a_f: frequencies and amplitudes

PhyPraKit.barstat(bincont, bincent, pr=True)¶

statistics from a bar chart (histogram) with given bin contents and bin centres

Args:

bincont: array with bin content
bincent: array with bin centres

Returns:

float: mean, sigma and sigma on mean

PhyPraKit.chi2p_indep2d(H2d, bcx, bcy, pr=True)¶

perform a chi2-test on independence of x and y

Args:

H2d: histogram array (as returned by histogram2d)
bcx: bin contents x
bcy: bin contents y

Returns:

float: p-value w.r.t. assumption of independence

PhyPraKit.convolutionPeakfinder(a, width=10, th=0.1)¶

find positions of all Peaks and Dips in data

(simple version for didactical purpose, consider using scipy.signal.find_peaks_cwt() )

method:

convolute array a with signal template of given width and return extrema of convoluted signal, i.e. places where template matches best

Args:

a: array-like, input data
width: int, width of signal to search for
th: float, relative threshold for peaks above minimum

Returns:

pidx: list, indices in original array of peaks

PhyPraKit.generateXYdata(xdata, model, sx, sy, mpar=None, srelx=None, srely=None, xabscor=None, yabscor=None, xrelcor=None, yrelcor=None)¶

Generate measurement data according to some model assumes xdata is measured within the given uncertainties; the model function is evaluated at the assumed “true” values xtrue, and a sample of simulated measurements is obtained by adding random deviations according to the uncertainties given as arguments.

Args:

xdata: np-array, x-data (independent data)
model: function that returns (true) model data (y-dat) for input x
mpar: list of parameters for model (if any)

the following are single floats or arrays of length of x

sx: gaussian uncertainty(ies) on x
sy: gaussian uncertainty(ies) on y
srelx: relative gaussian uncertainty(ies) on x
srely: relative gaussian uncertainty(ies) on y

the following are common (correlated) systematic uncertainties

xabscor: absolute, correlated error on x
yabscor: absolute, correlated error on y
xrelcor: relative, correlated error on x
yrelcor: relative, correlated error on y

Returns:

np-arrays of floats:
- xtrue: true x-values
- ytrue: true value = model(xtrue)
- ydata: simulated data

PhyPraKit.hist2dstat(H2d, xed, yed, pr=True)¶

calculate statistical information from 2d Histogram

Args:

H2d: histogram array (as returned by histogram2d)
xed: bin edges in x
yed: bin edges in y

Returns:

float: mean x
float: mean y
float: variance x
float: variance y
float: covariance of x and y
float: correlation of x and y

PhyPraKit.histstat(binc, bine, pr=True)¶

calculate mean of a histogram with bincontents binc and bin edges bine

Args:

binc: array with bin content
bine: array with bin edges

Returns:

float: mean, sigma and sigma on mean

PhyPraKit.kFit(func, x, y, sx, sy, p0=None, p0e=None, xabscor=None, yabscor=None, xrelcor=None, yrelcor=None, title='Daten', axis_labels=['X', 'Y'], plot=True, quiet=False)¶

fit function func with errors on x and y; uses package kafe

Args:

func: function to fit

x: np-array, independent data

y: np-array, dependent data

the following are single floats or arrays of length of x

sx: uncertainty(ies) on x

sy: uncertainty(ies) on y

p0: array-like, initial guess of parameters

p0e: array-like, initial guess of parameter uncertainties

xabscor: absolute, correlated error(s) on x

yabscor: absolute, correlated error(s) on y

xrelcor: relative, correlated error(s) on x

yrelcor: relative, correlated error(s) on y

title: string, title of gaph

axis_labels: List of strings, axis labels x and y

plot: flag to switch off graphical ouput

quiet: flag to suppress text and log output

Returns:

np-array of float: parameter values
np-array of float: parameter errors
np-array: cor correlation matrix
float: chi2 chi-square

PhyPraKit.kRegression(x, y, sx, sy, xabscor=None, yabscor=None, xrelcor=None, yrelcor=None, title='Daten', axis_labels=['X', 'Y'], plot=True, quiet=False)¶

linear regression y(x) = ax + b with errors on x and y; uses package kafe

Args:

x: np-array, independent data

y: np-array, dependent data

the following are single floats or arrays of length of x

sx: uncertainty(ies) on x

sy: uncertainty(ies) on y

xabscor: absolute, correlated error(s) on x

yabscor: absolute, correlated error(s) on y

xrelcor: relative, correlated error(s) on x

yrelcor: relative, correlated error(s) on y

title: string, title of gaph

axis_labels: List of strings, axis labels x and y

plot: flag to switch off graphical ouput

quiet: flag to suppress text and log output

Returns:

float: a slope
float: b constant
float: sa sigma on slope
float: sb sigma on constant
float: cor correlation
float: chi2 chi-square

PhyPraKit.labxParser(file, prlevel=1)¶

read files in xml-format produced with Leybold CASSY

Args:

file: input data in .labx format
prlevel: control printout level, 0=no printout

Returns:

list of strings: tags of measurmement vectors
2d list: measurement vectors read from file

PhyPraKit.linRegression(x, y, sy)¶

linear regression y(x) = ax + b

method:

analytical formula

Args:

x: np-array, independent data
y: np-array, dependent data
sx: np-array, uncertainty on y

Returns:

float: a slope
float: b constant
float: sa sigma on slope
float: sb sigma on constant
float: cor correlation
float: chi2 chi-square

PhyPraKit.linRegressionXY(x, y, sx, sy)¶

linear regression y(x) = ax + b with errors on x and y uses numerical “orthogonal distance regression” from package scipy.odr

Args:

x: np-array, independent data
y: np-array, dependent data
sx: np-array, uncertainty on y
sy: np-array, uncertainty on y

Returns:

float: a slope
float: b constant
float: sa sigma on slope
float: sb sigma on constant
float: cor correlation
float: chi2 chi-square

PhyPraKit.meanFilter(a, width=5)¶

apply a sliding average to smoothen data,

method:

value at index i and int(width/2) neighbours are averaged to from the new value at index i

Args:

a: np-array of values
width: int, number of points to average over (if width is an even number, width+1 is used)

Returns:

av smoothed signal curve

PhyPraKit.nhist(data, bins=50, xlabel='x', ylabel='frequency')¶

Histogram.hist show a one-dimensional histogram

Args:

data: array containing float values to be histogrammed
bins: number of bins
xlabel: label for x-axis
ylabel: label for y axix

Returns:

float arrays bin content and bin edges

PhyPraKit.nhist2d(x, y, bins=10, xlabel='x axis', ylabel='y axis', clabel='counts')¶

Histrogram.hist2d create and plot a 2-dimensional histogram

Args:

x: array containing x values to be histogrammed
y: array containing y values to be histogrammed
bins: number of bins
xlabel: label for x-axis
ylabel: label for y axix
clabel: label for colour index

Returns:

float array: array with counts per bin
float array: histogram edges in x
float array: histogram edges in y

PhyPraKit.odFit(fitf, x, y, sx, sy, p0=None)¶

fit an arbitrary function with errors on x and y uses numerical “orthogonal distance regression” from package scipy.odr

Args:

fitf: function to fit, arguments (array:P, float:x)

x: np-array, independent data

y: np-array, dependent data

sx: np-array, uncertainty on x

sy: np-array, uncertainty on y

p0: none, scalar or array, initial guess of parameters

Returns:

np-array of float: parameter values
np-array of float: parameter errors
np-array: cor correlation matrix
float: chi2 chi-square

PhyPraKit.offsetFilter(a)¶: correct an offset in array a (assuming a symmetric signal around zero) by subtracting the mean

PhyPraKit.profile2d(H2d, xed, yed)¶

generate a profile plot from 2d histogram:

mean y at a centre of x-bins, standard deviations as error bars

Args:

H2d: histogram array (as returned by histogram2d)
xed: bin edges in x
yed: bin edges in y

Returns:

float: array of bin centres in x
float: array mean
float: array rms
float: array sigma on mean

PhyPraKit.readCassy(file, prlevel=0)¶

read Data exported from Cassy in .txt format

Args:

file: string, file name
prlevel: printout level, 0 means silent

Returns:

units: list of strings, channel units
data: tuple of arrays, channel data

PhyPraKit.readColumnData(fname, ncols=4, cchar='#', delimiter=None, pr=True)¶

read column-data from file

input is assumed to be columns of floats
characters following <cchar>, and <cchar> itself, are ignored
words with preceeding ‘*’ are taken as keywords for meta-data, text following the keyword is returned in a dictionary

Args:

string fnam: file name
int ncols: number of columns
char delimiter: character separating columns
bool pr: print input to std out if True

PhyPraKit.readPicoScope(file, prlevel=0)¶

read Data exported from PicoScope in .txt or .csv format

Args:

file: string, file name
prlevel: printout level, 0 means silent

Returns:

units: list of strings, channel units
data: tuple of arrays, channel data

PhyPraKit.resample(a, t=None, n=11)¶

perform average over n data points of array a, return reduced array, eventually with corresponding time values

method:

value at index i and int(width/2) neighbours are averaged to from the new value at index i

Args:

a, t: np-arrays of values of same length
width: int, number of values of array a to average over (if width is an even number, width+1 is used)

Returns:

av: array with reduced number of samples
tav: a second, related array with reduced number of samples

PhyPraKit.simplePeakfinder(x, a, th=0.0)¶

find positions of all Peaks and Dips in data: x-coordinates are determined from weighted average over 3 data points

this only works for very smooth data with well defined extrema use convolutionPeakfinder instead

Args:

x: np-array of values

a: np-array of values at postion x

th: float, threshold for peaks

Returns:

np-array: x position of peak as weighted mean over neighbours

np-array: y posiion

PhyPraKit.smearData(d, s, srel=None, abscor=None, relcor=None)¶

Generate measurement data from “true” input d by

adding random deviations according to the uncertainties

Args:

d: np-array, (true) input data

the following are single floats or arrays of length of array d

s: gaussian uncertainty(ies) (absolute)
srel: gaussian uncertainties (relative)

the following are common (correlated) systematic uncertainties

abscor: absolute, correlated uncertainty
relcor: relative, correlated uncertainty

Returns:

np-array of floats: dm, smeared (=measured) data

PhyPraKit.wmean(x, sx, pr=True)¶

weighted mean of np-array x with uncertainties sx

Args:

x: np-array of values
sx: uncertainties
pr: if True, print result

Returns:

float: mean, sigma

test_readColumnData.py

test data input from text file with module PhyPraKit.readColumnData

test_readPicoSocpe.py

Einlesen von Daten aus einer mit PicoScope erstellten Datei

cassylabxParser.py

read files in xml-format produced with the Leybold Cassy system

uses PhyPraPit.labxParser()

test_Historgram.py

test histogram functionality in PhyPraKit

test_kRegression: test linear regression with kafe using kFit from PhyPrakKit uncertainties in x and y and correlated absolute and relative uncertainties

test_kFit: test fiting an arbitrary fucntion with kafe, with uncertainties in x and y and correlated absolute and relative uncertainties

test_generateDate

test generation of simulated data this simulates a measurement with given x-values with uncertainties; random deviations are then added to arrive at the true values, from which the true y-values are then calculated according to a model function. In the last step, these true y-values are smeared by adding random deviations to obtain a sample of measured values

kfitf.py

Perform a fit with the kafe package driven by input file

usage: kfitf.py [-h] [-n] [-s] [-c] [–noinfo] [-f FORMAT] filename

positional arguments:

filename name of fit input file

optional arguments:

`-h, --help`	show this help message and exit
`-n, --noplot`	suppress ouput of plots on screen
`-s, --saveplot`	save plot(s) in file(s)
`-c, --contour`	plot contours and profiles
`--noinfo`	suppress fit info on plot
`--noband`	suppress 1-sigma band around function
`--format FMT`	graphics output format, default FMT = pdf

Beispliel_Drehpendel.py

Auswertung der Daten aus einer im CASSY labx-Format gespeicherten Datei am Beispiel des Drehpendels

Einlesen der Daten im .labx-Format

Säubern der Daten durch verschiedene Filterfunktionen: - offset-Korrektur - Glättung durch gleitenden Mittelwert - Zusammenfassung benachberter Daten durch Mittelung

Fourier-Transformation (einfach und fft)

Suche nach Extrema (peaks und dips)

Anpassung von Funkionen an Einhüllende der Maxima und Minima

Interpolation durch Spline-Funktionen

numerische Ableitung und Ableitung der Splines

Phasenraum-Darstellung (aufgezeichnete Wellenfunktion gegen deren Ableitung nach der Zeit)

Beispiel_Hysterese.py

Auswertung der Daten aus einer mit PicoScope erstellten Datei im txt-Format am Beispiel des Hystereseversuchs

Einlesen der Daten aus PicoScope-Datei vom Typ .txt

Darstellung Kanal_a vs. Kanal_b

Auftrennung in zwei Zweige für steigenden bzw. abnehmenden Strom

Interpolation durch kubische Splines

Integration der Spline-Funktionen

test_readPicoSocpe.py

Einlesen von Daten aus dem mit PicoScope erstellten Dateien am Beispiel der akustischen Anregung eines Stabes

Dokumentation von PhyPraKit¶

Indices and tables¶

Darstellung und Auswertung von Messdaten¶

Dokumentation der Beispiele¶

Modul-Dokumentation¶

Table Of Contents

This Page

Navigation

Dokumentation von PhyPraKit¶

Indices and tables¶

Darstellung und Auswertung von Messdaten¶

Dokumentation der Beispiele¶

Modul-Dokumentation¶

Table Of Contents

This Page

Quick search

Navigation