• Subscribe

The data scientist conundrum – reducing the burden of complexity (Part 1/3)

Making best use of data scientist skills relies on integrated end-to-end software solution

Becoming a data scientist mandates a significant interdisciplinary journey; besides the required training in mathematics, statistics, computer science (e.g., programming, databases etc.) it also requires a basic understanding of the system modeled. It takes years of formal and on the job training to become an expert in this field. Data scientists have been exceedingly employed in manufacturing lines to identify issues and improve the overall manufacturing efficiency. The first step in this process is to extract data from numerous dissimilar sources and transform it into a more digestible format for modeling purposes. Industry leaders have long contended that data wrangling is exceedingly time consuming and unstructured to serve many of the quality needs of customers throughout the manufacturing industry[1],[2]

To relieve the data wrangling burden on the data scientist, we have introduced the SmartFactory Rx Digital Platform that addresses this challenge by providing an easy to use, highly configurable minimal code environment with established connectors to many common data sources, including flat files. This way, Extracting, Transferring, Logging (ETL) and contextualizing data becomes a reduced chore and thus the data scientists can spend their time more productively to monitor, get insights and optimize the process they are responsible for.

The initial stage of ETL and contextualization is just the tip of the iceberg of the data scientist’s burden. Additional challenges that data scientists face in many manufacturing line setups today are typically the following: working with multiple software packages for data gathering and analytics (depending on the data source) and addressing the various connectivity and integration challenges. (Figure 1A). These challenges lead to a constant need for data scientists to rely on IT for support and diminishes the data scientist’s ability to rapidly extract supplementary value out of the data.

As a result, getting access to highly integrated software solutions, such as SmartFactory Rx Automation Digital Platform, enables manufacturers to connect data silos and improve the capability for data scientists (Figure 1B) to grasp valuable data insights quickly.
Figure 1A
Figure 1A: Common practice in industry: MES = Management Exc. System, PI = Process Information, BAS = Building Automation system, LIMS = Lab Information Management System
Figure 1B
Figure 1B: SmartFactory Rx enables one stop shopping for data scientists to quickly move up the data pyramid to more advanced analytics. Our SMEs provide best in class experience.

To learn more about our offering to aid Pharma Data Scientists

About the Author

Yoram Barak, PhD, Ilias Iliopoulos, PhD
Yoram Barak, PhD, Ilias Iliopoulos, PhD
The SmartFactory Rx Team develops integrated automation solutions for process manufacturing to harness the power of data, reduce development time and improve productivity to optimize high value manufacturing. It increases throughput, decreases risk, and accelerates time to market for new products. For more details, connect with us on LinkedIn.