Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(idp-extraction-connector): update the idp connector to accept documents #3835

Merged
merged 1 commit into from
Jan 10, 2025

Conversation

reiballa
Copy link
Contributor

@reiballa reiballa commented Jan 8, 2025

Description

This PR introduces support for documents as input for the outbound IDP extraction connector.

How to test locally:

In order to test this, you need to :

{
  "baseRequest": {
    "authentication": {
      "accessKey": "<<your access key>",
      "secretKey": "<<your access secret>>"
    },
    "configuration": {
      "region": "us-east-1"
    }
  },
  "input": {
    "extractionEngineType": "AWS_TEXTRACT",
    "document": <<the json that was returned when uploading the document>>
    },
    "converseData": {
      "modelId": "anthropic.claude-3-5-sonnet-20240620-v1:0",
      "maxTokens": 512,
      "temperature": 0.5,
      "topP": "0.9"
    },
    "taxonomyItems": [
      {
        "name": "sum",
        "prompt": "the total amount that was paid for this invoice"
      },
      {
        "name": "supplier",
        "prompt": "who provided the goods or services"
      }
    ]
  }
}

You can update the taxonomy items to something that makes more sense to the pdf document that you have uploaded.

Related issues

closes #3824

Checklist

  • PR has a milestone or the no milestone label.

@reiballa reiballa self-assigned this Jan 8, 2025
@reiballa reiballa marked this pull request as ready for review January 8, 2025 17:29
@reiballa reiballa requested a review from a team as a code owner January 8, 2025 17:29
@sbuettner sbuettner added this to the 8.7.0-alpha4 milestone Jan 9, 2025
Copy link
Contributor

@mathias-vandaele mathias-vandaele left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great work, one minor comment

group = "input",
type = TemplateProperty.PropertyType.Hidden,
description = "Specify the URL where the document is hosted",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We usually prefer to use tooltip over description

@mathias-vandaele mathias-vandaele added this pull request to the merge queue Jan 10, 2025
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Jan 10, 2025
@reiballa reiballa added this pull request to the merge queue Jan 10, 2025
Merged via the queue into main with commit 1fdadce Jan 10, 2025
22 of 23 checks passed
@reiballa reiballa deleted the 3824-add-document-support-for-idp-connector branch January 10, 2025 15:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Move IDP connector template to support Document handling functionality
3 participants