Therapeutic Area Level of Clinical Study Analysis
Requirement
Introduction: This analysis aims to provide valuable insights by aggregating data related to various therapeutic areas, allowing stakeholders to understand the progress and effectiveness of studies across different therapeutic domains. The analysis will assist in identifying high-performing areas, areas requiring more resources, and study status distributions.
Requirements: Create a Pyspark logic by Read the table “study_therapeutic_analysis“. Group the data by “therapeutic_area" and “study_title” with aggregations of total_studies, Count of all studies per therapeutic area. Mark completed_studieswnen study_conduct_status is Completed. Mark ongoing_studies when study_conduct_status is as Ongoing. Average enrolled study subjects per therapeutic area,
Final Output: Display the results with the column of “therapeutic_area“, “total_studies“, “total_studies“, “ongoing_studies“, “avg_enrolled_subjects"
Unity Catalog: “purgo_playground.study_therapeutic_analysis“