ECE Assignment 4: Web Client and Server

Assignment 4: Web Client and Server

1 Environment

To set up the environment for this assignment, you must follow the instructions from “assignment0”.

The instructions can be found at “ECE50863/assignment0/README.pdf”. Once the VM has booted

(by using the command “vagrant up”) and you have established a SSH connection (by using the

command “vagrant ssh”), you should enter the assignment directory by running the command “cd

assignment4”. Now you are ready to run the assignment!

2 Web Page Downloader (30% credit)

There is a very useful program called “wget”. It is a command line tool that you can use to download

a web page like this:

$ wget http://www.gnu.org/software/make/manual/make.html

This will download the make manual page, “make.html”, and save it in the current directory. “wget”

can do much more (for example, downloading a whole web site). See the manual for “wget” for more

information.

For the first part of this assignment, your task is to write a limited version of “wget”, which we

will call “http_client”, that can download a single file. You will do all your implementation in

the file“http_client.c” inside the “http_client” directory. To build and run the code, do the following:

First, go inside the “http_client” directory,

$ cd http_client

Next, compile the code,

$ make

Finally, run the code,

$ ./http_client [host] [port number] [filepath]

For example, ./http_client www.gnu.org 80 /software/make/manual/make.html

So you give the components of the URL separately in the command line — (1) the host, (2) the port

number (which will always be 80 for HTTP), and (3) the file path. The program will download the

given file and save it in the current directory. So in the example above, it should produce “make.html”

in the current directory. It should overwrite an existing file.

Some useful hints:

1. The program should open a TCP socket connection to the host and port number specified in the

command line, and then request the given file using HTTP/1.x protocol.

(See hĴp://www.jmarshall.com/easy/hĴp/ for the details of HTTP/1.x protocol).

2. An HTTP GET request looks like this:

GET /path/file.html HTTP/1.0\r\n

[zero or more headers]\r\n

[blank line]\r\n

Include the following header in your request:

Host: the.host.name.you.are.connecting.to:

Also note that each line in the request ends with a \r\n. This is required by the HTTP protocol.

3. The response from the web server will look something like this:

HTTP/1.0 200 OK\r\n

Date: Fri, 31 Dec 2020 23:59:59 GMT\r\n

Content-Type: text/html\r\n

Content-Length: 1354\r\n

[blank line]\r\n

[file content]

The code ”200” in the first line indicates that the request was successful. If it’s not ”200”, the

program should print the first line to the terminal (stdout) and exit.

You will need to extract the file name from file path (for example, extract make.html from file

path /software/make/manual/make.html), create a new file with extracted file name in the current directory, and write the received file content into that file.

You should use the “Content-Length” value to figure when to stop receiving data from the

web server and close the TCP connection. If the “Content-Length” field is not present in the

response header, print the following error message to the terminal (stdout) and exit:

Error: could not download the requested file (file length unknown)

4. Some useful C library functions for parsing—“strchr()”, “strrchr()”, “strtok()”, “strstr()”

5. You program should be able to download any type of file, not just HTML files. Test your code by

downloading all the different files on the website hĴp://www.gnu.org/software/make/manual/

3 Web Server (70% credit)

For the second part of the assignment, your task is to write a HTTP web server using sockets interface.

The web server will run on TCP port 8080 (typically HTTP web servers run on port 80). This task has

two sub-tasks as described in the following sections. You will implement both the sub-tasks in the

file“http_server.c” inside the “http_server” directory. To build and run the code:

First, go inside the “http_server” directory,

$ cd http_server

Next, compile the code,

$ make

Finally, run the code,

$ ./http_server

3.1 Serving static contents (40% credit)

In this part, you will write a web server that serves static content. The top level directory (called the

“web root”) for your HTML files will be the “Webpage/” directory provided with the assignment.

The web server will only serve contents inside the web root directory. For testing, you can add / remove / modify contents inside web root. We can also do the same while evaluating your submission.

The content served by the web server should be accessible through a web browser running on your

host machine by typing the following URL:

http://localhost:8888/path/to/content/relative/to/web/root

For example, URL http://localhost:8888/index.html should display file “Webpage/index.html”.

Note that the address (localhost) and port number (8888) in the URL is different from the IP address

(VM’s IP address) and port number (8080) of the web server. This is because we are using port forwarding between your host machine and the VM, which does network address translation (NAT) to

direct all the traffic with destination address localhost:8888 to VM’s port number 8080.

Writing a web server is not a trivial task. Here is the list of what is expected and what is not expected

from your web server:

1. The web server will be iterative, i.e., it will serve client requests one request at a time. The

server should close the TCP socket (returned by the “accept()” call) after serving each request.

In practice, most web servers are concurrent, i.e., they could serve multiple client requests in

parallel using multithreading or multiprocessing (e.g., using “fork()”).

2. The web server will only support the GET method. If a browser sends other methods (POST, HEAD,

PUT, for example), the server responds with status code 501. Here is a possible response:

HTTP/1.0 501 Not Implemented\r\n

[blank line]\r\n

501 Not Implemented

Note that server adds a liĴle HTML body for the status code and the message. Without this, the

browser will display a blank page. This should be done for all status codes except 200.

3. Our server will be strictly HTTP/1.0 server. That is, all responses will say “HTTP/1.0”, and all

successful responses will include status code “200 OK”.

The server will accept GET requests that are either HTTP/1.0 or HTTP/1.1 (most browsers these

days send HTTP/1.1 requests). But it will always respond with HTTP/1.0. The server should

reject any other protocol and/or version, responding with 501 status code.

4. The server should also check that the request URI (the part that comes after GET) starts with “/”.

If not, it should respond with “400 Bad Request”.

5. In addition, the server should make sure that the request URI does not contain “/../” and it

does not end with “/..” because allowing “..” in the request URI is a big security risk—the

client will be able to fetch a file outside the web root. If true, respond with “400 Bad Request”.

Note: Most modern browsers automatically check for bad URL requests mentioned in points 4

and 5, and appropriately format the URL before sending it to the server. So, to test points 4 and

5, you can use your “http_client” from Section 2 to send a bad URL request to the server.

6. The server must log each request to terminal (stdout) like this:

128.59.22.109 "GET /index.html HTTP/1.1" 200 OK

It should show the client IP address, the entire request line, and the status code and reason

phrase that the server just sent to the browser (Figure 1).

Figure 1: Sample terminal (stdout) output.

You must log the requests in the exact format as shown in Figure 1. You must not print anything

else to stdout. Violations of these guidelines would result in grade penalty.

7. If the request URI ends with “/”, the server should treat it as if there were “index.html” appended to it. For example, given

http://localhost:8888/

the server will act as if it had been given

http://localhost:8888/index.html

8. If the request URI is a directory, but does not have a “/” at the end, then you should append

“index.html” to it.

Use “stat()” function to determine if a path is a directory or a file.

9. The server sends “404 Not Found” if it is unable to open the requested file.

10. For reading the file, you can use “fread()” or “read()”. You should read the file in chunks and

send it to the client as you read each chunk. The chunk size should be 4096 bytes (that is 4K, the

optimal buffer size for disk I/O for many types of OS/hardware).

Do not read the file one character at a time using “fgetc()” or “getc()”. Do not read the file

one line at a time using “fgets()”—this may not work for image files.

3.2 Serving dynamic contents (30% credit)

In this part, you will add a database service to your web server. Web servers often have to contact

a database to serve certain client requests. In this assignment, clients can request a cat picture by

entering a search string in the textbox displayed on the web page. On geĴing such a request, the web

server will contact the database of cat pictures, and respond with the cat picture requested by the

client. To start the database server, run:

$ ./db_server

Next, start the web server in a different terminal,

$ ./http_server

Some useful hints:

1. If the current URL in your browser is http://localhost:8888/, and you enter the search string

”cute cat” in the textbox and submit, the URL in the browser will now point to,

http://localhost:8888/?key=cute+cat

and the web server will receive the request URI “/?key=cute+cat”. You should extract the

search string ”cute cat” from the URI and send it to the database server.

2. The web server will communicate with the database server over a UDP socket (“SOCK_DGRAM”).

The database server’s IP address and port number are defined in the macros “DBADDR” and

“DBPORT” respectively inside the file “http_server.c”.

3. On receiving a search string, the database server will append ”.jpg” to the search string, and

search for the file with that name inside the directory “cat_database”. If found, the database

server will send the file to the web server in UDP packets of size 4K bytes each. The web server

should relay the data received in those UDP packets to the client over the TCP connection. Once

the entire file has been sent, the database server will send a final UDP packet containing the

string “DONE”. The web server should stop receiving once it receives this final packet. Note

that the web server should not relay the contents of this final packet to the client.

4. If the file is not found in the database, the database server will respond with a UDP packet

containing the string “File Not Found”. On receiving this packet, the web server should respond

to the client with “404 Not Found”.

5. If the database server is not responding, the web server should not wait indefinitely for the

response. Instead, it should timeout after some time interval (e.g., 5 seconds), and respond to

the client with “408 Request Timeout”.

To test this functionality, simply do not start the database server. Then all client requests for cat

pictures should timeout.

One way to implement timeout is to make the UDP socket non-blocking, and use the “select()”

system call to determine when there is some data to be read. The “select()” system call has

an argument of type “struct timeval”, which can be set to the timeout value. If no data is

received by the socket within the timeout interval, “select()” will return 0.

6. Make sure that the logging to the terminal (Figure 1) that you implemented in Section 3.1 also

works for cat picture requests.

Figure 2: Sample terminal (stdout) output for cat picture requests.

4 Submission

You are required to submit two files “http_client.c” and “http_server.c” on Brightspace.

Note: Port forwarding does not work reliably with certain browsers. In our experience, Google

Chrome seems to work most reliably with port forwarding. So, we recommend using Google Chrome

as the browser for this assignment.

联系我们

QQ：99515681
邮箱：99515681@qq.com
工作时间：8:00-21:00
微信：codinghelp

热点文章

辅导 comm2000 creating socia... 2026-01-08
讲解 isen1000 – introductio... 2026-01-08
讲解 cme213 radix sort讲解 c... 2026-01-08
辅导 csc370 database讲解迭代 2026-01-08
讲解 ca2401 a list of colleg... 2026-01-08
讲解 nfe2140 midi scale play... 2026-01-08
讲解 ca2401 the universal li... 2026-01-08
辅导 engg7302 advanced compu... 2026-01-08
辅导 comp331/557 – class te... 2026-01-08
讲解 soft2412 comp9412 exam辅... 2026-01-08
讲解 scenario # 1 honesty讲解... 2026-01-08
讲解 002499 accounting infor... 2026-01-08
讲解 comp9313 2021t3 project... 2026-01-08
讲解 stat1201 analysis of sc... 2026-01-08
辅导 stat5611: statistical m... 2026-01-08
辅导 mth2010-mth2015 - multi... 2026-01-08
辅导 eeet2387 switched mode ... 2026-01-08
讲解 an online payment servi... 2026-01-08
讲解 textfilter辅导 r语言 2026-01-08
讲解 rutgers ece 434 linux o... 2026-01-08

热点标签

msinm014/msing014/msing014b

联系我们 - QQ: 99515681 微信：codinghelp

程序辅导网！