请问这里的数据如何用Python的正则表达式匹配出来?(或者其他更简便方法?)

2025-02-25 05:35:40
推荐回答(2个)
回答1:

# -*- coding:utf-8 -*-
import re
s=u'[{"timestamp":1462590135,"rawXML":"<\/id><\/username><\/createTime>0<\/contentDescShowType>4<\/contentDescScene><\/private><\/contentDesc><\/contentattr><\/sourceUserName><\/sourceNickName><\/statisticsData><\/location><\/contentStyle><![CDATA[婚姻中遇到真爱,要不要离婚?]]><\/title><description><![CDATA[婚姻中遇到真爱应该离婚吗?\n真的有灵魂伴侣吗?\n如何判断一个人适不适合成为自己的伴侣?]]><\/description><contentUrl><!'<br><br>m=re.compile('<username>(.*)<\\\/username>')<br>print re.search(m, s).groups()[0]<br><br>输出>>><br><![CDATA[xxxxxx123]]></pre> <p>注意:\及/为正则表达式中特殊符号,需要转义才可用。</p></p> </div> </div> <div class="clear"></div> </div> <div class="wdhdnr"> <div class="huidanrtop"> <div class="wdhuidaxinx"> <div class="wdhuidaxm">回答2:</div> </div> </div> <div class="clear"></div> <div class="wdhuidanrmid"> <div class="zuijiacont"> <p>目测需要json和xml模块, json把数据转成json格式,然后从其中提取rawXML, 这段用xml模块解析</p> </div> </div> <div class="clear"></div> </div> </div> </div> <div class="wendaright"> <div class="wdluluerwema"> <div class="wdxgwttop">相关问答</div> <div class="wdxgwtnr"> </div> <div class="clear"></div> </div> <!-- 其他随机问答['id'=>alphaID($like['zid'])] --> <div class="wdluluerwema"> <div class="wdxgwttop">最新问答</div> <div class="wdxgwtnr"> <div class="wdxgwtcont"> <div class="wdxgtitle"><a href="https://13l.net/l/1993475285529387067.html">在美国生孩子就是美国国籍了吗?</a></div> </div> <div class="wdxgwtcont"> <div class="wdxgtitle"><a href="https://13l.net/l/363246619.html">2011年12月30日,阴历是12月初6,下午3:25出生的男孩叫什么名字好,姓李,请按五行帮忙算下,谢谢</a></div> </div> <div class="wdxgwtcont"> <div class="wdxgtitle"><a href="https://13l.net/l/474819808.html">有没有什么一女多男或者是一女多男的韩剧,男的一定要帅,女的一定要漂亮、、好的加悬赏</a></div> </div> <div class="wdxgwtcont"> <div class="wdxgtitle"><a href="https://13l.net/l/579240383.html">以前学过电子琴过了8级 很久没碰 如何捡起来??</a></div> </div> <div class="wdxgwtcont"> <div class="wdxgtitle"><a href="https://13l.net/l/296987028.html">设矩阵A是对称正定矩阵,则用__迭代法解线性方程组AX=b其迭代解数列一定收敛</a></div> </div> <div class="wdxgwtcont"> <div class="wdxgtitle"><a href="https://13l.net/l/1835441796817234100.html">去油止脱洗发水</a></div> </div> <div class="wdxgwtcont"> <div class="wdxgtitle"><a href="https://13l.net/l/1373014989036534139.html">白羊女和天秤男合适吗?</a></div> </div> <div class="wdxgwtcont"> <div class="wdxgtitle"><a href="https://13l.net/l/1818517462635429948.html">电脑开机滴一声后无法启动!不知什么原因?</a></div> </div> <div class="wdxgwtcont"> <div class="wdxgtitle"><a href="https://13l.net/l/334922384.html">我12岁110斤身高156cm我没时间做运动怎么减肥阿求各位大侠</a></div> </div> <div class="wdxgwtcont"> <div class="wdxgtitle"><a href="https://13l.net/l/373515448.html">斗罗大陆275章小舞对唐三说的第一句话是什么</a></div> </div> </div> </div> </div> <div class="clear"></div> <div class="footer"> <!-- 移动底部导航 --> <div class="fanhuitop"><a href="#top" ref="nofollow"><img src="https://13l.net/static/old/img/fhtop.png" alt="返回顶部" title="返回顶部"></a></div> <div class="dibu"> <div class="dibu"> </div> </div> <div class="banquan"> <p>内容全部来源于网络收集,如有侵权,请联系网站删除:QQ:24596024</p> </div> </div> </div> </div> <script> var _hmt = _hmt || []; (function() { var hm = document.createElement("script"); hm.src = "https://hm.baidu.com/hm.js?de17be6dbd20544dd6483cc235b540f9"; var s = document.getElementsByTagName("script")[0]; s.parentNode.insertBefore(hm, s); })(); </script> </body> </html>