statistics – Page 5 – Tarik Billa

What is log-likelihood? [closed]

August 25, 2023 by Tarik

The only reason to use the log-likelihood instead of the plain old likelihood is mathematical convenience, because it lets you turn multiplication into addition. The plain old likelihood is P(parameters | data), i.e. assuming your data is fixed and you vary the parameters of your model. Maximizing this is one way to do parameter estimation … Read more

scipy, lognormal distribution – parameters

August 23, 2023 by Tarik

The distributions in scipy are coded in a generic way wrt two parameter location and scale so that location is the parameter (loc) which shifts the distribution to the left or right, while scale is the parameter which compresses or stretches the distribution. For the two parameter lognormal distribution, the “mean” and “std dev” correspond … Read more

3D Least Squares Plane

August 21, 2023 by Tarik

If you have n data points (x[i], y[i], z[i]), compute the 3×3 symmetric matrix A whose entries are: sum_i x[i]*x[i], sum_i x[i]*y[i], sum_i x[i] sum_i x[i]*y[i], sum_i y[i]*y[i], sum_i y[i] sum_i x[i], sum_i y[i], n Also compute the 3 element vector b: {sum_i x[i]*z[i], sum_i y[i]*z[i], sum_i z[i]} Then solve Ax = b for the … Read more

Linear Regression in Javascript [closed]

August 21, 2023 by Tarik

What kind of linear regression? For something simple like least squares, I’d just program it myself: http://mathworld.wolfram.com/LeastSquaresFitting.html The math is not too hard to follow there, give it a shot for an hour or so and let me know if it’s too hard, I can try it. EDIT: Found someone that did it: http://dracoblue.net/dev/linear-least-squares-in-javascript/159/

Sample from multivariate normal/Gaussian distribution in C++

August 17, 2023 by Tarik

Since this question has garnered a lot of views, I thought I’d post code for the final answer that I found, in part, by posting to the Eigen forums. The code uses Boost for the univariate normal and Eigen for matrix handling. It feels rather unorthodox, since it involves using the “internal” namespace, but it … Read more

Is it possible to get statistics with TortoiseSVN?

August 15, 2023 by Tarik

You can get basic statistics by using “Show Log…” and then “Statistics” (Button at the bottom IIRC)

How to force zero interception in linear regression?

August 15, 2023 by Tarik

As @AbhranilDas mentioned, just use a linear method. There’s no need for a non-linear solver like scipy.optimize.lstsq. Typically, you’d use numpy.polyfit to fit a line to your data, but in this case you’ll need to do use numpy.linalg.lstsq directly, as you want to set the intercept to zero. As a quick example: import numpy as … Read more

Random document in ElasticSearch

August 13, 2023 by Tarik

I know it is an old question, but now it is possible to use random_score, with the following search query: { “size”: 1, “query”: { “function_score”: { “functions”: [ { “random_score”: { “seed”: “1477072619038” } } ] } } } For me it is very fast with about 2 million documents. I use current timestamp … Read more