书生大模型实战营-L0-Python基础知识

Sep 30, 2024

Category 课外学习 Tags

#大模型

本节任务要点#

Python实现wordcount
Vscode连接InternStudio debug笔记

实践流程#

实现wordcount#

请实现一个wordcount函数，统计英文字符串中每个单词出现的次数。返回一个字典，key为单词，value为对应单词出现的次数。

Input:

1
"""Hello world!
2
This is an example.
3
Word count is fun.
4
Is it fun to count words?
5
Yes, it is fun!"""

Output:

1
{'hello': 1, 'world': 1, 'this': 1, 'is': 4, 'an': 1, 'example': 1, 'word': 1, 'count': 2,
2
'fun': 3, 'it': 2, 'to': 1, 'words': 1, 'yes': 1}

TIPS：记得先去掉标点符号,然后把每个单词转换成小写。不需要考虑特别多的标点符号，只需要考虑实例输入中存在的就可以。

1
text = """
2
Got this panda plush toy for my daughter's birthday,
3
who loves it and takes it everywhere. It's soft and
4
super cute, and its face has a friendly look. It's
5
a bit small for what I paid though. I think there
6
might be other options that are bigger for the
7
same price. It arrived a day earlier than expected,
8
so I got to play with it myself before I gave it
9
to her.
10
"""
11

12
def wordcount(text):
13
    pass

直接交给kimi

实际代码

1
text = """Hello world!
2
This is an example.
3
Word count is fun.
4
Is it fun to count words?
5
Yes, it is fun!"""
6

7
text1 = """
8
Got this panda plush toy for my daughter's birthday,
9
who loves it and takes it everywhere. It's soft and
10
super cute, and its face has a friendly look. It's
11
a bit small for what I paid though. I think there
12
might be other options that are bigger for the
13
same price. It arrived a day earlier than expected,
14
so I got to play with it myself before I gave it
15
to her.
16
"""
17

18
import re
19

20
def wordcount(text):
21
    # 将文本转换为小写
22
    text = text.lower()
23
    # 使用正则表达式移除标点符号
24
    text = re.sub(r'[^\w\s]', '', text)
25
    # 分割文本为单词列表
26
    words = text.split()
27
    # 创建一个字典来存储单词计数
28
    word_count = {}
29
    # 遍历单词列表，计数
30
    for word in words:
31
        if word in word_count:
32
            word_count[word] += 1
33
        else:
34
            word_count[word] = 1
35
    return word_count
36

37
print(wordcount(text))
38
print(wordcount(text1))

测试结果

连接InternStudio debug笔记#

请使用本地vscode连接远程开发机，将上面你写的wordcount函数在开发机上进行debug，体验debug的全流程，并完成一份debug笔记(需要截图)。

在32行打点，表示第一次达到这个点的时候，应该是有一个单词已经出现过一次，才会进入这一行，查看text文本，这里进入的word是is，is正好出现第二次

选择单步运行，发现word_count里面的is对应的数值已经是2了，增加了1

取消32行断点，设置35行断点，直接运行到该断点，左侧统计出了该text文本段所有词汇的出现次数。

总结#

温故知新，debug很好用，不要再总是一次次print了。ai编程也是个好东西，提高生产力。

Author Junyao Hu

Published Sep 30, 2024

Link https://junyaohu.github.io/blog/internlm-l0-python/

本节任务要点#

实践流程#

实现wordcount#

连接InternStudio debug笔记#

总结#

Comments