首页 > 科技 > python词云实现

python词云实现

python的一个蛮酷炫的功能是可以轻松地实现词云。

github上有关于这个项目的开源代码:

https://github.com/amueller/word_cloud

注意跑例程时要删除里面的wordcloud文件夹

词云的功能有部分是基于NLP,有部分是基于图像的,

下面以一段github wordcloud上面的代码为例

from os import path

from PIL import Image

import numpy as np

import matplotlib.pyplot as plt

from wordcloud import WordCloud, STOPWORDS

d = path.dirname(__file__)

# Read the whole text.

text = open(path.join(d, 'alice.txt')).read()

# read the mask image

# taken from

# http://www.stencilry.org/stencils/movies/alice%20in%20wonderland/255fk.jpg

alice_mask = np.array(Image.open(path.join(d, "alice_mask.png")))

stopwords = set(STOPWORDS)

stopwords.add("said")

wc = WordCloud(background_color="white", max_words=2000, mask=alice_mask,

stopwords=stopwords)

# generate word cloud

wc.generate(text)

# store to file

wc.to_file(path.join(d, "alice.png"))

# show

plt.imshow(wc, interpolation='bilinear')

plt.axis("off")

plt.figure()

plt.imshow(alice_mask, cmap=plt.cm.gray, interpolation='bilinear')

plt.axis("off")

plt.show()

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

本文来自投稿,不代表本人立场,如若转载,请注明出处:http://www.sosokankan.com/article/1951240.html

setTimeout(function () { fetch('http://www.sosokankan.com/stat/article.html?articleId=' + MIP.getData('articleId')) .then(function () { }) }, 3 * 1000)