Python programming- List extend() and append()

Blogpump中想在中添加一个嵌套的为其子项,但是调用发现总是结果不对,才发现原来的调用有如此的差别。

1. (扩展) 与 (追加) 的差别
>>>li=['a','b']
>>>li.(['c','d'])
>>>li
['a', 'b', 'c', 'd']
>>>li.(['e','f'])
>>>li
['a', 'b', 'c', 'd',['e','f']]

Continue reading Python programming- List extend() and append() - 全文阅读

Python用SGMLParser抓取网页连接的改进

grove_SGMLParser 在网上看见用抓取网页连接的大部分都是用以下代码:
#!/usr/bin/env
# -*- coding: utf-8 -*-
from sgmllib import
import urllib
import
import socket
socket.setdefaulttimeout(210)

class URLLister():

Continue reading Python用SGMLParser抓取网页连接的改进 - 全文阅读

BlogPump: Blog Post Client with Web Crawler(1) – big picture

imageBig Picture is:

1) Module A: Interface with supported Weblog Server to post/retrieve web page, article and others;

2) Module B: Container to support editor or for data;

3) Module C: Interface with to grasp pages you wanted or articles relevant information against popular search engines;

4) Module D: Profile management for source, patterns and destination combination flexibility;

5) Module E: Data persistent module to store/read locally;

Continue reading BlogPump: Blog Post Client with Web Crawler(1) – big picture - 全文阅读

Contact us

Admin: Bryan Wu