Efficiently Streaming Large HTTP Responses With HttpClient

Downloading large files with HttpClient and you see that it takes lots of memory space? This post is probably for you. Let's see how to efficiently streaming large HTTP responses with HttpClient.

11 May 2014

2 minutes read

.NET

ASP.NET Web API

HTTP

3 minutes read

22 May 2016

.NET

.NET Core

ASP.Net

101

ASP.NET Core

8 minutes read

9 April 2016

.NET

.NET Core

ASP.NET Core

Microsoft Azure

4 minutes read

3 January 2016

.NET

ASP.Net

101

ASP.NET 5

Linux

2 minutes read

11 November 2015

.NET

ASP.Net

101

ASP.NET 5

Elasticsearch

Geek Talks

2 minutes read

5 November 2015

.NET

ASP.Net

101

ASP.NET 5

Identity

MongoDB

5 minutes read

28 October 2015

.NET

ASP.Net

101

ASP.NET 5

Elasticsearch

4 minutes read

20 October 2015

.NET

ASP.Net

101

ASP.NET 5

HTTP

2 minutes read

13 September 2015

.NET

ASP.Net

101

ASP.NET 5

2 minutes read

12 September 2015

.NET

ASP.Net

101

ASP.NET 5

Geek Talks

4 minutes read

16 August 2015

.NET

ASP.NET 5

DLM

Geek Talks

1 minutes read

7 July 2015

.NET

ASP.Net

101

ASP.NET 5

Geek Talks

3 minutes read

16 June 2015

.NET

ASP.Net

101

ASP.NET 5

DLM

Geek Talks

3 minutes read

29 April 2015

.NET

ASP.Net

101

ASP.NET 5

JavaScript

Visual Studio

5 minutes read

28 April 2015

.NET

ASP.NET 5

3 minutes read

31 March 2015

.NET

Roslyn

4 minutes read

18 November 2014

.NET

ASP.Net

101

9 minutes read

6 October 2014

.NET

ASP.Net

101

ASP.NET MVC

ASP.NET vNext

2 minutes read

5 October 2014

.NET

ASP.Net

101

ASP.NET vNext

Tips

Visual Studio

5 minutes read

28 September 2014

.NET

ASP.Net

101

ASP.NET vNext

HTTP

2 minutes read

12 April 2014

.NET

JavaScript

MongoDB

I see common scenarios where people need to download large files (images, PDF files, etc.) on their .NET projects. What I mean by large files here is probably not what you think. It should be enough to call it large if it’s 500 KB as you will hit a memory limit once you try to download lots of files concurrently in a wrong way as below:

static async Task HttpGetForLargeFileInWrongWay()
{
    using (HttpClient client = new HttpClient())
    {
        const string url = "https://github.com/tugberkugurlu/ASPNETWebAPISamples/archive/master.zip";
        using (HttpResponseMessage response = await client.GetAsync(url))
        using (Stream streamToReadFrom = await response.Content.ReadAsStreamAsync())
        {
            string fileToWriteTo = Path.GetTempFileName();
            using (Stream streamToWriteTo = File.Open(fileToWriteTo, FileMode.Create))
            {
                await streamToReadFrom.CopyToAsync(streamToWriteTo);
            }

            response.Content = null;
        }
    }
}

By calling GetAsync method directly there, we are loading every single byte into memory. You can see this happening in a simple way by opening the Task Manager and observing the memory of the process.

We are calling ReadAsStreamAsync on HttpContent after the GetAsync method is completed. This will just get us the MemoryStream, so there is no point there:

We need a way not to load the response body into memory and have the raw network stream so that we can pass the bytes into another stream without hitting the memory too hard. We can do it by just reading the headers of the response and then getting a handle for the network stream as below:

static async Task HttpGetForLargeFileInRightWay()
{
    using (HttpClient client = new HttpClient())
    {
        const string url = "https://github.com/tugberkugurlu/ASPNETWebAPISamples/archive/master.zip";
        using (HttpResponseMessage response = await client.GetAsync(url, HttpCompletionOption.ResponseHeadersRead))
        using (Stream streamToReadFrom = await response.Content.ReadAsStreamAsync())
        {
            string fileToWriteTo = Path.GetTempFileName();
            using (Stream streamToWriteTo = File.Open(fileToWriteTo, FileMode.Create))
            {
                await streamToReadFrom.CopyToAsync(streamToWriteTo);
            }
        }
    }
}

Notice that we are calling another overload of the GetAsync method by passing the HttpCompletionOption enumeration value as ResponseHeadersRead. This switch tells the HttpClient not to buffer the response. In other words, it will just read the headers and return the control back. This means that the HttpContent is not ready at the time when you get the control back. Afterwards, we are getting the stream and calling the CopyToAsync method on it by passing our FileStream. The result is much better:

Efficiently Streaming Large HTTP Responses With HttpClient

Related Posts

Resources

Tags