Recommended approach to process large amount of data from Excelsheet

Hi there,

I have a question on the design of one of the projects I am working on. In one of the projects, we have a requirement of fetching an Excel sheet (size ~12 MB) having ~55000 rows into Appian daily from external source and process it. I would like to know what would be the better/recommended approach -

a) fetch and parse excel file through Appian (using Excel Tools plugin). [Per my understanding, fetching and parsing 55k rows of data daily using Appian may impact the performance of Appian.]

b) fetch and parse excel file through some other tool like Informatica and implement only workflow part of it in Appian.

Thanks!

  Discussion posts and replies are publicly visible

Parents
  • 0
    Certified Lead Developer
    Hello Rohit,

    Processing large amount of data will impact the performance of Appian environment. We have different Excel Tolls plugin / smartservice to read the excel and import to db and then a separate call to another stored proc to execute business logic, refine and further process the data.
    This can be one of the approach where we will not hold data into the CDT or in process model. Which will not consume more AMU.

    Another approach is handle complete operation outside Appian environment. We can have a java batch process program / webservice / file upload and process program outside, which will process the data directly and then we can have the statistics which can be initially stored into a table and later fetched on Appian and business users and check the processing results.

    I would suggest to keep the implementation outside which will also a scalable solution for the future enhancement.

    Thank You
    Bibhuti
Reply
  • 0
    Certified Lead Developer
    Hello Rohit,

    Processing large amount of data will impact the performance of Appian environment. We have different Excel Tolls plugin / smartservice to read the excel and import to db and then a separate call to another stored proc to execute business logic, refine and further process the data.
    This can be one of the approach where we will not hold data into the CDT or in process model. Which will not consume more AMU.

    Another approach is handle complete operation outside Appian environment. We can have a java batch process program / webservice / file upload and process program outside, which will process the data directly and then we can have the statistics which can be initially stored into a table and later fetched on Appian and business users and check the processing results.

    I would suggest to keep the implementation outside which will also a scalable solution for the future enhancement.

    Thank You
    Bibhuti
Children