Need a simple example of calculating RMSE with Pandas DataFrame. Providing there is function that returns in cycle true and predicted value:
def fun (data): ... return trueVal, predVal for data in set: fun(data)
And then some code puts these results in the following data frame where
x
is a real value andp
is a predicted value:In [20]: d Out[20]: {'p': [1, 10, 4, 5, 5], 'x': [1, 2, 3, 4, 5]} In [21]: df = pd.DataFrame(d) In [22]: df Out[22]: p x 0 1 1 1 10 2 2 4 3 3 5 4 4 5 5
Questions:
1) How to put results from
fun
function indf
data frame?2) How to calculate RMSE using
df
data frame?
Answer
Question 1
This depends on the format that data is in. And I’d expect you already have your true values, so this function is just a pass through.
Question 2
With pandas
((df.p - df.x) ** 2).mean() ** .5
With numpy
(np.diff(df.values) ** 2).mean() ** .5
Attribution
Source : Link , Question Author : zork , Answer Author : piRSquared