dask.dataframe.tseries.resample.Resampler.sem

dask.dataframe.tseries.resample.Resampler.sem¶

Resampler.sem()[源代码]¶

计算各组均值的标准误差，排除缺失值。

此文档字符串是从 pandas.core.resample.Resampler.sem 复制而来的。

Dask 版本可能存在一些不一致性。

对于多个分组，结果索引将是 MultiIndex。

参数

ddofint, 默认值为 1 （Dask 中不支持）: 自由度。
仅数值bool, 默认 False (Dask 中不支持): 只包含 float、int 或 boolean 数据。

1.5.0 新版功能.

在 2.0.0 版更改: numeric_only 现在默认设置为 False。

返回

Series 或 DataFrame: 每个组内值的平均标准误差。

示例

对于 SeriesGroupBy:

>>> lst = ['a', 'a', 'b', 'b']  
>>> ser = pd.Series([5, 10, 8, 14], index=lst)  
>>> ser  
a     5
a    10
b     8
b    14
dtype: int64
>>> ser.groupby(level=0).sem()  
a    2.5
b    3.0
dtype: float64

对于 DataFrameGroupBy：

>>> data = [[1, 12, 11], [1, 15, 2], [2, 5, 8], [2, 6, 12]]  
>>> df = pd.DataFrame(data, columns=["a", "b", "c"],  
...                   index=["tuna", "salmon", "catfish", "goldfish"])
>>> df  
           a   b   c
    tuna   1  12  11
  salmon   1  15   2
 catfish   2   5   8
goldfish   2   6  12
>>> df.groupby("a").sem()  
      b  c
a
1    1.5  4.5
2    0.5  2.0

对于重采样器：

>>> ser = pd.Series([1, 3, 2, 4, 3, 8],  
...                 index=pd.DatetimeIndex(['2023-01-01',
...                                         '2023-01-10',
...                                         '2023-01-15',
...                                         '2023-02-01',
...                                         '2023-02-10',
...                                         '2023-02-15']))
>>> ser.resample('MS').sem()  
2023-01-01    0.577350
2023-02-01    1.527525
Freq: MS, dtype: float64

dask.dataframe.tseries.resample.Resampler.quantile

dask.dataframe.tseries.resample.Resampler.size