[Python知识库] python-去重

开发: C++知识库 Java知识库 JavaScript Python PHP知识库人工智能区块链大数据移动开发嵌入式开发工具数据结构与算法开发测试游戏开发网络协议系统运维
教程: HTML教程 CSS教程 JavaScript教程 Go语言教程 JQuery教程 VUE教程 VUE3教程 Bootstrap教程 SQL数据库教程 C语言教程 C++教程 Java教程 Python教程 Python3教程 C#教程
数码: 电脑笔记本显卡显示器固态硬盘硬盘耳机手机 iphone vivo oppo 小米华为单反装机图拉丁

-> Python知识库 -> python-去重 -> 正文阅读

[Python知识库]python-去重

字符串，列表去重

可以使用set对字符串或者列表去重，集合是无序，不重复的序列，可以使用set进行去重

#单词去重
def words_dumplate(sentence):

    return ' '.join(set(sentence.split()))

print(words_dumplate("Python is great and Java is also great"))

#字符串去重
def str_dumplate(string):
    
    return ''.join(set(list(string)))

lst=["1",2,4,3,2,4]
print(list(set(lst)))

运行结果：

Python Java also great is and
wrcegst
[3, '1', 2, 4]

使用set可以很方便的去掉字符串，列表的重复字符，但是使用这个方法有一个问题，就是得到新的字符串或列表元素顺序发生了变化

去掉重复元素，剩余元素仍保留顺序

def order_dumplate(lst):
    #定义一个列表用来记录最终结果
    new_lst=[]
    for item in lst:
        if  item not in new_lst:
            new_lst.append(item)
    return new_lst


lst=["1",2,3,2,3,4,5,4]

print(order_dumplate(lst))

str1="e4r442ee44rrr"
print(order_dumplate(list(str1)))

代码可以进一步优化，使用生成器

def dedupe(items):
    seen=list()
    for item in items:
        if item not in seen:
            yield item
            seen.append(item)


lst=["1",2,3,2,3,4,5,4]

print(list(dedupe(lst)))

如果给一个字典列表去重，需要指定一个函数用来将序列中的元素转换为可哈希的类型

def dedupe(items,key=None):
    seen=list()
    for item in items:
        val=key(item) if key else item
        if val not in seen:
            yield item
            seen.append(val)

lst=["1",2,3,2,3,4,5,4]
print(list(dedupe(lst)))

lst1=[{"x":1,"y":2},{"x":2,"y":"3"},{"x":1,"y":3}]
lst2=[{"x":1,"y":2},{"x":2,"y":"3"},{"x":1,"y":3},{"x":1,"y":2}]

print(list(dedupe(lst1,key=lambda x:x["x"])))
print(list(dedupe(lst2,key=lambda x:(x["x"],x["y"]))))

运行结果：

['1', 2, 3, 4, 5]
[{'x': 1, 'y': 2}, {'x': 2, 'y': '3'}]
[{'x': 1, 'y': 2}, {'x': 2, 'y': '3'}, {'x': 1, 'y': 3}]

这里的key是一个回调函数，因为字典是不可哈希序列，需要规定两个字典根据什么判断为相同

Python知识库最新文章

使用Nordic的nrf52840实现蓝牙DFU过程

【Python学习记录】numpy数组用法整理

Python学习笔记

python字符串和列表

python如何从txt文件中解析出有效的数据

Python编程从入门到实践自学/3.1-3.2

python变量

加:2022-04-27 11:17:51 更:2022-04-27 11:18:03

360图书馆购物三丰科技阅读网日历万年历 2025年12日历

-2025/12/18 19:54:13-

图片自动播放器
↓图片自动播放器↓

TxT小说阅读器
↓语音阅读,小说下载,古典文学↓

一键清除垃圾
↓轻轻一点,清除系统垃圾↓

图片批量下载器
↓批量下载图片,美女图库↓

网站联系: qq:121756557 email:121756557@qq.com IT数码