Linux网络-HttpServer的实现

文章目录

前言
一、请求报文的解析
- URL的解析
二、响应报文的发送
三、尝试发送一个HTML网页
- 404网页
- [Location 重定向](#Location 重定向)
四、浏览器的多次请求行为
总结

前言

之前我们简单理解了一下Http协议，本章我们将在LInux下使用Socket编程自主完成一个HttpServer。可以做到接收Http报文数据，加以解析再向远端发送Http报文数据。

之前写过很多遍的网络套接字编程代码就不再重复写了，这里直接写关于HttpServer的代码

一、请求报文的解析

上一章我们讲了，请求报文主要分为请求行，请求报头，请求正文。

所以，我们就需要来解析我们收到的报文数据。

cpp 复制代码

class HttpRequest
{
public:
    HttpRequest()
    {
    }

    bool Deserialize(std::string &request)
    {
        size_t pos = request.find(sep);
        if (pos == std::string::npos)
        {
            // 不完整报文
            lg(Warning, "Recv Incomplete Request...");
            return false;
        }
        _request_line = request.substr(0, pos);
        request.erase(0, pos + sep.size());

        std::string tmp;
        while (true)
        {
            pos = request.find(sep);
            if (pos == std::string::npos)
            {
                break;
            }
            tmp = request.substr(0, pos);
            if (tmp.empty())
            {
                // 说明已经截到空行
                break;
            }
            _request_header.push_back(tmp);
            request.erase(0, pos + sep.size());
        }
        request.erase(0, sep.size());
        _content = request;
        return true;
    }

    bool Parse()
    {
        std::string tmp = _request_line;
        int pos = tmp.find(blank);
        if (pos == std::string::npos)
        {
            // 解析的请求行存在问题
            return false;
        }
        _function = tmp.substr(0, pos);
        tmp.erase(0, pos + blank.size());

        pos = tmp.find(blank);
        if (pos == std::string::npos)
        {
            // 解析的请求行存在问题
            return false;
        }
        std::string url_tmp = tmp.substr(0, pos);
        if (url_tmp == "/")
        {
            _url = homepage;
        }
        else
        {
            _url = fileroot;
            _url += url_tmp;
        }

        tmp.erase(0, pos + blank.size());


        _http_version = tmp;
        return true;
    }


public:
    std::string _request_line;
    std::vector<std::string> _request_header;
    std::string _content;
    std::string _function;
    std::string _url;
    std::string _http_version;
	
	bool _isFound = true; //判断是否存在访问资源
};

上面通过的request成员函数，可以讲一份完整的报文全部解析下来。

URL的解析

上章我们讲过，URL的作用是为了找到该服务器上唯一的资源，那么我们就需要对URL再进行解释，才能正确找到想要请求的文件。

一般来讲我们的，我们在网址上的URL其实是在服务器的工作目录中的查找的，当然，如果你想访问其他目录的文件，只需要自己稍作解析即可，我们仅谈论大多数情况。

所以，为了可以更好的控制访问资源，我们就可以在服务器工作目录创建一个web根目录，将所有需要用到的其他资源分类放进去。

就比如说这里，我们创建了一个名为webroot的根目录。

再在服务器内部代码定义根目录路径，后续只需要直接在后面添加我们解析后的URL字符串就可以实现精准访问唯一一份资源了。

cpp 复制代码

std::string ReadFileData(const std::string &filepath)
{
    std::ifstream in(filepath, std::ios::binary);
    if (!in.is_open())
    {
        // 文件打开失败,返回一个空串
        lg(Warning, "File Open Failed...");
        return "";
    }
    // 将文件流指针移动到文件结尾
    in.seekg(0, std::ios_base::end);
    auto len = in.tellg();
    // 重新将文件流指针移动到文件开头
    in.seekg(0, std::ios_base::beg);

    std::string content;
    content.resize(len);

    in.read((char *)content.c_str(), content.size());

    return content;
}

因为我们有时候会读取一个二进制文件，例如png,jpg格式的图片，所以我们这里采用二进制读取的方式打开文件。

最后返回的content就是文件的全部数据。

二、响应报文的发送

在上章我们也讲过相应报头是由状态行，响应报头，正文组成。

所以，我们要遵循http协议，就必须要遵守http协议的响应报头发送格式来发送数据。

cpp 复制代码

    std::string Encode(const std::string &content, const HttpRequest &hr)
    {
        std::string mes;
        if (hr._isFound)
        {
            mes += "HTTP/1.0 200 OK\r\n";
        }
        else
        {
            mes += "HTTP/1.0 404 NotFound\r\n";
        }
        mes += "Content-Lenth: ";
        mes += std::to_string(hr._content.size());
        mes += sep;

        mes += "Content-Type: ";
        mes += SuffixtoType(hr._suffix);
        mes += sep;

        mes += "Set-Cookie: ";
        mes += hr._content;
        mes += sep;

        mes += sep; // 空行

        mes += content;
        return mes;
    }

该Encode函数就帮我们格式化了一个还算完整的响应报文。