Puppeteer on AWS Lambda – DEV Neighborhood

October 6, 2022

1

Questioning how one can get Puppeteer to work correctly on AWS Lambda?

You’re in the fitting place! On this put up, we’ll cowl the principle challenges you possibly can encounter whereas making an attempt to try this. However first, let’s begin with introducing each Puppeteer and AWS Lambda.

What’s Puppeteer?

Merely put, Puppeteer is a software program for controlling a (headless) browser. It’s a chunk of open-source software program developed and supported by Google’s developer instruments staff. It lets you simulate consumer interplay with a browser via a easy API. That is very useful for doing issues like automated checks or internet scraping.

An image’s price a thousand phrases. How a lot is a gif price? With slightly little bit of code proven within the gif under, I can log in to a Google account. You merely must click on, enter textual content, paginate, and scrape all of the publicly out there knowledge you want.

What’s AWS Lambda?

AWS Lambda is what Amazon calls “Run code with out excited about servers or clusters.” You’ll be able to merely create a perform on Lambda after which execute it. It’s that simple.

Merely put, you are able to do the whole lot on AWS Lambda. Okay, the whole lot is a robust phrase, however nearly. For instance, it’s attainable to scrape 1000’s of public internet pages each evening with AWS Lambda capabilities. Additionally, it manages to insert knowledge into databases.

Getting began with AWS Lambda is easy and cheap. You solely must pay for what you employ, and so they even have a beneficiant free trial.

Drawback #1 – Puppeteer is just too massive to push to Lambda

AWS Lambda has a 50 MB restrict on the zip file you push on to it. As a result of the truth that it installs Chromium, the Puppeteer bundle is considerably bigger than that. Nonetheless, this 50 MB restrict doesn’t apply once you load the perform from S3! See the documentation right here.

AWS Lambda quotas may be tight for Puppeteer

The 250 MB unzipped may be bypassed by importing instantly from an S3 bucket. So we create a bucket in S3, use a node script to add to S3, after which replace our Lambda code from that bucket. The script seems to be one thing like this:

“zip”: “npm run construct && 7z a -r perform.zip ./dist/* node_modules/”,

“sendToLambda”: “npm run zip && aws s3 cp perform.zip s3://chrome-aws && rm perform.zip && aws lambda update-function-code –function-name puppeteer-examples –s3-bucket chrome-aws –s3-key perform.zip”

Puppeteer on AWS Lambda doesn’t work

By default, Linux (together with AWS Lambda) doesn’t embrace the mandatory libraries required to permit Puppeteer to perform.

Luckily, there already exists a bundle of Chromium constructed for AWS Lambda. You’ll find it right here. You’ll need to put in it and puppeteer-core in your perform that you’re sending to Lambda.

The common Puppeteer bundle won’t be wanted and, actually, counts in opposition to your 250 MB restrict.

npm i --save chrome-aws-lambda puppeteer-core

After which, if you find yourself setting it as much as launch a browser from Puppeteer, it should appear to be this:

const browser = await chromium.puppeteer

.launch({

args: chromium.args,

defaultViewport: chromium.defaultViewport,

executablePath: await chromium.executablePath,

headless: chromium.headless

});

Remaining word

Puppeteer requires extra reminiscence than an everyday script, so control your max reminiscence utilization. When utilizing Puppeteer, we advocate a minimum of 512 MB in your AWS Lambda perform. Additionally, don’t overlook to run await browser.shut() on the finish of your script. In any other case, chances are you’ll find yourself along with your perform working till timeout for no purpose as a result of the browser continues to be alive and ready for instructions.

Previous articleFind out how to Concurrently Auto-Transcribe and File Zoom Calls With Descript

Puppeteer on AWS Lambda – DEV Neighborhood

What’s Puppeteer?

What’s AWS Lambda?

Drawback #1 – Puppeteer is just too massive to push to Lambda

Puppeteer on AWS Lambda doesn’t work

Remaining word

css – Greatest observe for customized main coloration of pages

Distinction Safety introduces new safety testing instruments for JavaScript frameworks

A lot Quicker than std::string, fmt::format, std::to_chars, std::time and extra?

LEAVE A REPLY Cancel reply

Most Popular

Find out how to Concurrently Auto-Transcribe and File Zoom Calls With Descript

css – Greatest observe for customized main coloration of pages

Keep away from This Linux Replace That Can Reportedly Harm Intel Laptop computer Shows

3D Printing Rising As Sport-Changer In Many Industries

Recent Comments

ABOUT US

POPULAR POSTS

Find out how to Concurrently Auto-Transcribe and File Zoom Calls With Descript

css – Greatest observe for customized main coloration of pages

Keep away from This Linux Replace That Can Reportedly Harm Intel Laptop computer Shows

POPULAR CATEGORY