Retrieving Web Pages (HTTP), Topic 3, Chapter 6

Network Programming Kansas State University at Salina Retrieving Web Pages (HTTP), Topic 3, Chapter 6

First, some comments • Switch to application protocols • Client side focus • Pre-build Modules • A natural OO thing – a matter of productivity • Argh!, someone else’s code • Lots of choices, language independent principles • Web related network programming • Chapter 6 – retrieving web pages – easy • Chapter 7 – Parsing HTML – hard • Chapter 8 – XML and XML-RPC – interesting

HTTP Basics • Stateless, connectionless protocol • Basic GET … import socket s = socket.socket(socket.AF_INET, socket.SOCK_STREAM) s.connect(('www.sal.ksu.edu', 80)) request = """GET /faculty/tim/index.html HTTP/1.0\n From: tim@sal.ksu.edu\n User-Agent: Python\n \n""" s.send(request) fp = open( "index.html", "w" ) while 1: data = s.recv(1024) if not len(data): break fp.write(data) s.close() fp.close()

Now, for the easy way … import sys, urllib2 page = "http://www.sal.ksu.edu/faculty/tim/" req = urllib2.Request(page) fd = urllib2.urlopen(req) while 1: data = fd.read(1024) if not len(data): break sys.stdout.write(data)

Submitting with GET >>> import urllib >>> encoding = urllib.urlencode( [('activity', 'water ski'), \ ('lake', 'Milford'), ('code', 52)] ) >>> print encoding activity=water+ski&lake=Milford&code=52 >>> url = "http://www.example.com" + '?' + encoding >>> print url http://www.example.com?activity=water+ski&lake=Milford&code=52

Submitting with POST >>> encoding = urllib.urlencode( [('activity', 'water ski'),\ ('lake', 'Milford'), ('code', 52)] ) >>> print encoding activity=water+ski&lake=Milford&code=52 >>> import urllib2 >>> req = urllib2.Request(url) >>> fd = urllib2.urlopen("http://www.example.com", encoding)

Retrieving Web Pages (HTTP), Topic 3, Chapter 6

Retrieving Web Pages (HTTP), Topic 3, Chapter 6

Presentation Transcript

Chapter 3 Review

Interesting Web Pages (Cosmology Unit)

Retrieving Information

Teacher- Helpful Web pages

97 pages in print 3,800 pages in the CD

Stock Exchanges web pages: companies

Designing Web Pages

Topic 1 Topic 2 Topic 3 Topic 4 Topic 5 Topic 6 Topic 7 Topic 8

Dynamic Web Pages

Web Pages Week - 9

Web server (serves web pages)

How to create Web Pages

Web 2.0

Chapter 4 Web Pages Using Web Standards

Part Five – Creating Web Pages

Chapter 16 Web Pages And CGI Scripts

Retrieving Web Pages (HTTP), Topic 3, Chapter 6

Chapter 6, Section 3 Pages 234-239.

Web Pages, Web Sites, And E-Commerce

DCF10 web pages

WEB PAGES: