pandas.Index.drop_duplicates#

Index.drop_duplicates(*, keep='first')[源代码][源代码]#

返回删除了重复值的索引。

参数:

keep : {‘first’, ‘last’, False}, 默认 ‘first’{‘first’, ‘last’,}

返回:

参见

例子

生成一个包含重复值的 pandas.Index。

>>> idx = pd.Index(["llama", "cow", "llama", "beetle", "llama", "hippo"])

keep 参数控制删除哪些重复值。值 ‘first’ 保留每个重复条目的首次出现。keep 的默认值是 ‘first’。

>>> idx.drop_duplicates(keep="first")
Index(['llama', 'cow', 'beetle', 'hippo'], dtype='object')

值 ‘last’ 保留每组重复条目的最后一次出现。

>>> idx.drop_duplicates(keep="last")
Index(['cow', 'beetle', 'llama', 'hippo'], dtype='object')

值 False 丢弃所有重复的条目集合。

>>> idx.drop_duplicates(keep=False)
Index(['cow', 'beetle', 'hippo'], dtype='object')