Convert Word Document to PDF in AWS Lambda

13 Sep 20245 minutes to read

Syncfusion DocIO is a .NET Core Word library used to create, read, edit and convert Word documents programmatically without Microsoft Word or interop dependencies. Using this library, you can convert a Word document to PDF in AWS Lambda.

Steps to convert Word document to PDF in AWS Lambda

Step 1: Create a new AWS Lambda project as follows.
AWS Lambda project

Step 2: Select Blueprint as Empty Function and click Finish.
Select Blueprint as Empty Function

Step 3: Install the following Nuget packages in your application from Nuget.org.

Install Syncfusion.DocIORenderer.Net.Core Nuget Package
Install SkiaSharp.NativeAssets.Linux Nuget Package
Install HarfBuzzSharp.NativeAssets.Linux Nuget Package

NOTE

Starting with v16.2.0.x, if you reference Syncfusion assemblies from trial setup or from the NuGet feed, you also have to add “Syncfusion.Licensing” assembly reference and include a license key in your projects. Please refer to this link to know about registering Syncfusion license key in your application to use our components.

Step 4: Create a folder and copy the required data files and include the files to the project.
Create a folder

Step 5: Set the copy to output directory to Copy if newer to all the data files.
Property change for data files

Step 6: Include the following namespaces in Function.cs file.

using Syncfusion.DocIO;
using Syncfusion.DocIO.DLS;
using Syncfusion.DocIORenderer;
using Syncfusion.Pdf;

step 7: Add the following code snippet in Function.cs to convert a Word document to PDF.

/// <summary>
/// A simple function that takes a string and does a ToUpper
/// </summary>
/// <param name="input"></param>
/// <param name="context"></param>
/// <returns></returns>
public string FunctionHandler(string input, ILambdaContext context)
{
    string filePath = Path.GetFullPath(@"Data/Adventure.docx");
    
    //Load the file from the disk
    FileStream fileStream = new FileStream(filePath, FileMode.Open, FileAccess.Read);
 
    WordDocument document = new WordDocument(fileStream, FormatType.Docx); 
    DocIORenderer render = new DocIORenderer();
    PdfDocument pdf = render.ConvertToPDF(document);
 
    //Save the document into stream
    MemoryStream stream = new MemoryStream();

    //Save the PDF document  
    pdf.Save(stream);

    //Releases all resources used by the Word document and DocIO Renderer objects
    document.Close();
    render.Dispose();

    //Closes the PDF document
    pdf.Close();
    return Convert.ToBase64String(stream.ToArray());
}

Step 8: Right-click the project and select Publish to AWS Lambda.
Publish to AWS Lambda

Step 9: Create a new AWS profile in the Upload Lambda Function Window. After creating the profile, add a name for the Lambda function to publish. Then, click Next.
Upload Lambda Function

Step 10: In the Advanced Function Details window, specify the Role Name as based on AWS Managed policy. After selecting the role, click the Upload button to deploy your application.
Advance Function Details

Step 11: After deploying the application, you can see the published Lambda function in AWS console.
After deploying the application

Step 12: Edit Memory size and Timeout as maximum in General configuration of the AWS Lambda function.
AWS Lambda Function

Steps to post the request to AWS Lambda

Step 1: Create a new console project.
Create a console project

step 2: Install the following Nuget packages in your application from Nuget.org.

Step 3: Include the following namespaces in Program.cs file.

using Amazon;
using Amazon.Lambda;
using Amazon.Lambda.Model;
using Newtonsoft.Json;

Step 4: Add the following code snippet in Program.cs to invoke the published AWS Lambda function using the function name and access keys.

//Create a new AmazonLambdaClient
AmazonLambdaClient client = new AmazonLambdaClient("awsaccessKeyID", "awsSecreteAccessKey", RegionEndpoint.USEast2);
 
//Create new InvokeRequest with published function name.
InvokeRequest invoke = new InvokeRequest
{
    FunctionName = "MyNewFunction",
    InvocationType = InvocationType.RequestResponse,
    Payload = "\"Test\""
};
//Get the InvokeResponse from client InvokeRequest.
InvokeResponse response = client.Invoke(invoke);
 
//Read the response stream
var stream = new StreamReader(response.Payload);
JsonReader reader = new JsonTextReader(stream);
var serilizer = new JsonSerializer();
var responseText = serilizer.Deserialize(reader);
//Convert Base64String into PDF document
byte[] bytes = Convert.FromBase64String(responseText.ToString());
FileStream fileStream = new FileStream("Sample.pdf", FileMode.Create);
BinaryWriter writer = new BinaryWriter(fileStream);
writer.Write(bytes, 0, bytes.Length);
writer.Close();
System.Diagnostics.Process.Start("Sample.pdf");

By executing the program, you will get the PDF document as follows.

Word to PDF in AWS Lambda

From GitHub, you can download the console application and AWS Lambda project.

Click here to explore the rich set of Syncfusion Word library (DocIO) features.

An online sample link to convert Word document to PDF in ASP.NET Core.