Fitting data with python: Difference between revisions
| m (Created page with "The basic syntax is the following: <source lang="py"> #!/usr/bin/python  from scipy import optimize from numpy import array  # your data as lists x = [0.0, 1.0, 2.0,  3.0] y = [1...") | |||
| (2 intermediate revisions by the same user not shown) | |||
| Line 1: | Line 1: | ||
| == Curve fitting == | |||
| Preparing noisy data: | |||
| <source lang="py"> | |||
| Npoints = 30 | |||
| x = np.linspace(1,10,100) | |||
| xb = np.linspace(1,10,Npoints) | |||
| f = lambda x: np.sin(x) | |||
| yb = f(xb) + 0.3*np.random.normal(size=len(xb)) | |||
| </source> | |||
| Using a polynomial fit that is based on generalized linear regression algorithm, solving a linear system. | |||
| <source lang="py"> | |||
| from numpy.polynomial import polynomial as P | |||
| coeff, stats = P.polyfit(xb,yb,9,full=True) | |||
| fitpoly = P.Polynomial(coeff) | |||
| print stats | |||
| </source> | |||
| fitpoly is a function and coeff are the coefficients of the optimal polynomial. | |||
| Using curve-fit that calls *leastsq* algorithm, taking a step-by-step search for the minimum. | |||
| <source lang="py"> | |||
| fitfunc = lambda x, a, b: a*np.sin(b*x) | |||
| p, pcov = curve_fit(fitfunc,xb,yb,p0 = [1.0,1.0]) | |||
| print p, np.sqrt(np.diag(pcov)) | |||
| </source> | |||
| The last lines provides the found optimal parameters and their uncertainties.  | |||
| It is worth trying several guesses p0. | |||
| Plotting the results: | |||
| <source lang="py"> | |||
| import matplotlib.pyplot as plt | |||
| plt.scatter(xb,yb) | |||
| plt.plot(x,f(x)) | |||
| plt.plot(x,fitpoly(x)) | |||
| plt.plot(x,fitfunc(x,p[0],p[1])) | |||
| plt.show() | |||
| </source> | |||
| == Using the least-square function directly == | |||
| The basic syntax is the following: | The basic syntax is the following: | ||
| <source lang="py"> | <source lang="py"> | ||
Latest revision as of 15:01, 31 January 2016
Curve fitting
Preparing noisy data: <source lang="py"> Npoints = 30 x = np.linspace(1,10,100) xb = np.linspace(1,10,Npoints) f = lambda x: np.sin(x) yb = f(xb) + 0.3*np.random.normal(size=len(xb)) </source>
Using a polynomial fit that is based on generalized linear regression algorithm, solving a linear system. <source lang="py"> from numpy.polynomial import polynomial as P coeff, stats = P.polyfit(xb,yb,9,full=True) fitpoly = P.Polynomial(coeff) print stats </source> fitpoly is a function and coeff are the coefficients of the optimal polynomial.
Using curve-fit that calls *leastsq* algorithm, taking a step-by-step search for the minimum. <source lang="py"> fitfunc = lambda x, a, b: a*np.sin(b*x) p, pcov = curve_fit(fitfunc,xb,yb,p0 = [1.0,1.0]) print p, np.sqrt(np.diag(pcov)) </source> The last lines provides the found optimal parameters and their uncertainties. It is worth trying several guesses p0.
Plotting the results: <source lang="py"> import matplotlib.pyplot as plt plt.scatter(xb,yb) plt.plot(x,f(x)) plt.plot(x,fitpoly(x)) plt.plot(x,fitfunc(x,p[0],p[1])) plt.show() </source>
Using the least-square function directly
The basic syntax is the following: <source lang="py">
- !/usr/bin/python
from scipy import optimize from numpy import array
- your data as lists
x = [0.0, 1.0, 2.0, 3.0] y = [1.0, 0.5, 0.0, -1.0]
- define a fitting function and the corresponding error function that must be minimized
- use the lambda shortcut or define standard functions with def fit():
- p is the list of parameters
fit = lambda p, x: p[0] + p[1]*(x) + p[2]*(x)**2 err = lambda p, x, y: fit(p,x) - y
- initial guess
p0 = [1.0,1.0,1.0]
- calls optimize.leastsq to find optimal parameters, converts lists into numpy.array on the fly
p, success = optimize.leastsq(err, p0, args=(array(x), array(y)), ftol=5e-9, xtol=5e-9)
- some info about convergence is in success and the optimized parameters in p
</source>