python扫描本地音乐并下载歌词的例子_python网络编程

当前位置：首页 > 脚本编程 > python > python网络编程 > 正文

python扫描本地音乐并下载歌词的例子: 发布时间：2019-12-09编辑：脚本学堂

本文介绍了python扫描本地音乐并下载歌词的方法，使用百度音乐搜索api实现本地音乐歌词搜索，有需要的朋友参考下。

百度提供了一个音乐搜索的api，百度请求类似于:
http://box.zhangmen.baidu.com/x?op=12&count=1&title=最佳损友$$陈奕迅$$$$的地址，百度会给你返回一段xml，如下:
This XML file does not appear to have any style information associated with it. The document tree is shown below.

复制代码代码示例:

<result>

<count>1</count>

<url>

<encode>

<![CDATA[

http://zhangmenshiting.baidu.com/data2/music/12762845/YmRqamdua21fn6NndK6ap5WXcJlrmG1xlJhobWibmGpjk5ZtmWiZcWRjZ5lqbGyelGKWlZtubGljZ5lka2uanWSXY1qin5t1YWBmZW

5ocGlhaWdnbGtqbzE$

]]>

</encode>

<decode>

<![CDATA[

12762845.mp3?xcode=e6b69cf593ea22ac9d2b9314e565fc0caf85125f065ce3e0&mid=0.31929107437537

]]>

</decode>

<type>8</type>

<lrcid>2829</lrcid>

<flag>1</flag>

</url>

<durl>

<encode>

<![CDATA[

http://zhangmenshiting2.baidu.com/data2/music/7345405/aGVnaWlmbGaeomZzrZmmnJZvmGqXbHCbl2dsZ5qXaWqSlWpsmmdrb2mXamxpbXCclGNsmW2ba25mYmxtapmZcWqTW

aGemnRoX2VkbWdvaGhoZmZramluOA$$

]]>

</encode>

<decode>

<![CDATA[

7345405.mp3?xcode=e6b69cf593ea22ac78e1478e78479dc19e8e4650995cb99a&mid=0.31929107437537

]]>

</decode>

<type>8</type>

<lrcid>2829</lrcid>

<flag>1</flag>

</durl>

<p2p>

<hash>f98b6772aa97966550ec80617879becee0233bf4</hash>

<url>

<![CDATA[ ]]>

</url>

<type>mp3</type>

<size>3778335</size>

<bitrate>128</bitrate>

</p2p>

</result>

由于只是获取到歌曲的lrc歌词地址，所以有用的只有2829这个标签。
而encode和decode里面的拼接起来就是mp3的下载地址，如本例的
http://zhangmenshiting.baidu.com/data2/music/12762845/YmRqamdua21fn6NndK6ap5WXcJlrmG1xlJhobWibmGpjk5ZtmWiZcWRjZ5lqbGyelGKWlZtubGljZ5lka2uanWSXY1qin5t1YWBm
ZW5ocGlhaWdnbGtqbzE$12762845.mp3?xcode=e6b69cf593ea22ac9d2b9314e565fc0caf85125f065ce3e0&mid=0.31929107437537

就是下载地址，不过音质太差，有时间在研究下这个。
继续说歌词，注意lrcid标签里面的2829。（脚本学堂编辑整理 www.jb200.com）
http://box.zhangmen.baidu.com/bdlrc/ 这个是百度lrc歌词存放地址,
然后本例的歌词地址是http://box.zhangmen.baidu.com/bdlrc/28/2829.lrc
看到了吧，歌词地址后面的两个数字的计算方法是在lrcid除以100所获得的整数，就是第一个数字，然后第二个数字就是lrcid，然后后面加上后缀.lrc就搞定了
获得lrc地址之后就简单了，只要请求该地址，然后将获取到的内容写入文件就ok了。
例子：

复制代码代码示例:

import os

import os.path

import re

import eyed3

import urllib2

import urllib

from urllib import urlencode

import sys

import os

reload(sys)

sys.setdefaultencoding('utf8')

music_path = r"E:music"

lrc_path = r"e:lrc"

os.remove('nolrc.txt')

os.remove('lrcxml.txt')

the_file = open('lrcxml.txt','a')

nolrc_file = open('nolrc.txt','a')

for root,dirs,files in os.walk(music_path):

    for filepath in files:

        the_path = os.path.join(root,filepath)

        if (the_path.find("mp3") != -1):

            print the_path

            the_music = eyed3.load(the_path)

            the_teg = the_music.tag._getAlbum()

            the_artist = the_music.tag._getArtist()

            the_title = the_music.tag._getTitle()

           # print the_teg

           # print the_title

           # print the_artist

            b = the_title.replace(' ','+')

           # print b

            a = the_artist.replace(' ','+')

            #print urlencode(str(b))

            if isinstance(a,unicode):

                a = a.encode('utf8')

            song_url = "http://box.zhangmen.baidu.com/x?op=12&count=1&title="+b+"$$"+a+"$$$$ "

            the_file.write(song_url+'n')

            page = urllib2.urlopen(song_url).read()

            print page

            theid = 0

            lrcid =  re.compile('<lrcid>(.*?)</lrcid>',re.S).findall(page)

            have_lrc = True

            if lrcid != []:

                theid = lrcid[0]

            else:

                nolrc_file.write(the_title+'n')

                have_lrc = False

            print theid

            if have_lrc:

                firstid = int(theid)/100

                lrcurl = "http://box.zhangmen.baidu.com/bdlrc/"+str(firstid)+"/"+theid+".lrc"

                print lrcurl

                lrc = urllib2.urlopen(lrcurl).read()

                if(lrc.find('html')== -1):

                    lrcfile = open(lrc_path+""+the_title+".lrc",'w')

                    lrcfile.writelines(lrc)

                    lrcfile.close()

                else:

                    nolrc_file.write(the_title+'n')

the_file.close()

nolrc_file.close()

print "end!"

有用第一步请求所获取到底是xml格式的，所以本来想着解析xml来获取lrcid，但是在实现过程中遇到了各种问题，别的还容易，就在这一块儿浪费的时间最长，纠结未果之后，只能改用正则表达式来获取了。只能说明还是学艺不精呢。

推荐阅读：

python实例之定时播放(闹钟+音乐)

以上就是python扫描本地音乐并下载歌词的实例教程，希望对大家有所帮助。

上一篇：python监控电脑实例教程(2)
下一篇：Python库urllib与urllib2主要区别

与 python扫描本地音乐并下载歌词的例子有关的文章

本文标题：python扫描本地音乐并下载歌词的例子
本页链接：http://www.jb200.com/article/python/23043.html

浏览排行

栏目分类

热点文章

python扫描本地音乐并下载歌词的例子