Crafting Test Automation to Avoid Data Entry Hell

To comply with my non-disclosure agreement, I have omitted and obfuscated confidential information in this case study. All information in this case study is my own and does not necessarily reflect the views of Battelle Memorial Institute.

The Gist

I was hired at Battelle Memorial Institute® as a Research Associate and was tasked to analyze and interpret thousands of data sets exploring the viability of a newly acquired, pharmaceutical technology. The analysis required 100% of my time and included numerous repetitive tasks, but the automation of these repetitive tasks was a challenge. The variation in each data set rendered common Excel® functions useless and required a custom solution that could uphold FDA accuracy requirements.

I set out to learn the Excel® VBA programming language to organize, shift, analyze, graph, and calculate the membrane tension of thousands of parts with minimal manual manipulation. The finished program changed the project entirely. It allowed a co-worker and myself to work on higher-level assignments, eliminated two full time roles, accurately analyzed the data over lunch breaks, and saved the project hundreds of thousands of dollars.

I was the sole contributor until I was joined by a co-worker halfway through the project. I wrote the initial code, designed the layout of the Excel® macro templates, and iteratively tested the program with real data sets. Once my co-worker joined, we doubled our iterative efforts to tweak and constantly improve the program until it could run thousands of data sets accurately and without fail. Our solution also met rigorous FDA requirements and became an official tool for the project.

The component that initated the tests.

The Opportunity

I was hired at Battelle Memorial Institute® to assist the mechanical engineering team in analyzing force spectrometer data with the goal of measuring the tension of a small membrane on a crucial component. The major problem was that there were thousands of parts to measure and therefore thousands of datasets to analyze. These thousands of parts varied greatly. They varied in height, material, and tension causing the data from each subset of tests to be considerably different. This led to the data being analyzed by hand when I arrived. It was a full-time job and the team couldn’t keep up with the analysis and their higher-level duties.

The test setup.

There had to be a better way. Luckily, after a full week of number crunching, I knew the process inside and out. There was an opportunity to automate most of the work, but I needed to get the support of my team. I was not the sole user of the solution and there were systems, procedures, and requirements already established that I had to honor. The main constraint was that the analysis took place in Microsoft Excel®. All of the other systems had been built around using Excel® as the main analysis tool. The data was too complicated for common, single-cell equations so I turned towards Excel® Macros as a possible solution. The only problem with that is that I had never used the Excel® Macro functions before and would have to take the time to train myself before programming a solution.

I needed buy-in from the rest of the team with the key stakeholders being my project manager and engineering lead. I completed a small pitch which included a timeline and estimate of the hours needed to complete the project. I sat down with each of the key stakeholders individually and pitched my plan. They signed off on the idea. We settled on using 20% of my time to work on the solution.

The Design

The first step in designing a solution was to conduct a needs analysis. Luckily I was the main user of the product so I had insider knowledge of how to structure the frontend of the Excel® sheets for efficiency, but I also used the other team members as a sounding board for my idea. I’m glad I did because the project manager gave me a glimpse of a new technical report structure that was going to be rolled out in the next few months. It allowed me to design for that particular report structure instead.

Another key finding was that it was an internal tool. No one cared if it looked good. It just needed to work and be used by anyone in the team at any moment. This knowledge combined with my understanding of the calculations and functions needed was enough to get started. I used many different internet resources to begin deciphering Excel® VBA with a focus on only the functions I needed. I was able to create the first prototype within a few weeks.

Screenshot of macro.

The Solution

The prototype came at the perfect time. We were ordered to conduct a higher volume of tests so even with the prototype’s release, our team had to hire another engineer to aid in the manufacturing and testing of the components. My new coworker added great insight throughout the rest of the project and helped build upon what I started.

We iterated our ideas and solutions with the new influx of tests until our Excel® Macro was foolproof. The program could now analyze and derive relevant data from thousands of data sets in a few hours (or as fast as the computer could go) instead of weeks of manual manipulation.

Screenshot of graph code.

Our time was freed up substantially. We were able to work on higher level work saving the project hundreds of thousands of dollars by eliminating two full-time roles and saving the time and resources needed to recruit and hire for the responsibilities we helped undertake.

The program analyzed over 10,000 data sets and met strict FDA accuracy standards. The program even passed a validation protocol to become an official tool for the project.

Award won after completion of project.

The Conclusion

The program achieved three major goals:

  • It eliminated two, full-time analysis roles.

  • It allowed my coworker and I to support the project at a higher level.

  • It saved our client and company hundreds of thousands of dollars over the life of the project.

    Building the program taught me much about the design process as well as the importance of stakeholder buy-in. I also proved to myself that I could learn new skills quickly, output quality work, and effectively communicate with my team during the process.

This project was where I saw the power of design. It is truly amazing how a simple solution can change the entire landscape of a multi-million dollar project. The magic was in truly understanding the problem at hand. The solution seemed to grow naturally once the problem was truly defined.