Ifraz Photo

About Me

I am a Doctorate student at Purdue University with five years of experience in various tech stacks such as: Hadoop Ecosystem, Python, Azure, Microsoft Power BI, R, and SQL Server. I have current knowledge of data analysis, data clean up, interpretation and visualization using modern programming languages and visualization tools.

Contact Details

Ifraz Ahmed
Houston, TX
IfrazAhmedProfessional@gmail.com

Experience

Clinic Scribe

Founder June 2024 - Present

• Founder of Clinic Scribe, a AI powered note taking application to assist healthcare workers in patient documentation and generating SOAP notes.
• Designed and developed the MVP for Clinic Whisper, an AI-powered clinical documentation tool that transcribes patient-provider conversations and generates SOAP notes using OpenAI Whisper and GPT-4.
• Built a live audio transcription pipeline using Streamlit, webrtc, Whisper, and Python, enabling real-time, browser-based recording and transcription.
• Integrated Open Source AI model to automatically convert clinical transcripts into structured SOAP notes, supporting multiple medical specialties.
• Implemented multilingual transcription support across 50+ languages, improving accessibility for diverse clinical settings.
• Deployed the app on Render with version-controlled codebase using GitHub and virtual environments.
• Deployed the app on Render with version-controlled codebase using GitHub and virtual environments.
• Collaborated on prompt engineering and UI/UX design to improve provider experience, onboarding flow, and real-time AI cue suggestions (Cue Coach™ concept).
• Established early-stage security practices aligned with HIPAA and SOC 2 readiness, using encrypted file handling and temporary data storage.
• Worked closely with the founding team to test early adoption through landing pages, waitlists, and user interviews, contributing to product-market validation.

Suvida Healthcare

Data Engineer July 2023 - May 2024

• Developed and maintained over twenty pipelines in Azure Synapse for ELT.
• Ingested data from SFTP servers, Salesforce API, and Snowflake API into Azure Data Lake Storage Gen 2 utilizing Azure pipelines.
• Loaded data from ADLS2 into staging database within SQL Server by leveraging data flows.
• Developed PySpark notebooks in ADF for querying REST APIs and writing their JSON payload to SQL Server.
• Utilized Azure DevOps for tracking user story progress in sprints and Azure Repos for version control.
• Operated Data Build Tool for implementing version control, testing, and freshness checks for SQL tables.
• Performed QA during enterprise-wide Snowflake migration to ensure data integrity, consistency, and reliability between deprecated and new systems.
• Developed complex SQL queries to join multiple tables leveraging natural keys such as datetime and email, ensuring accurate data integration and retrieval across diverse datasets.

Capgemini

Insights and Data Consultant January 2022 - March 2023

• Provided additional instruction to individuals falling behind by providing support for homework and projects
• Key role in integrating inventory management, transactions, order history, and service scheduling with GK POS, CAR, and SAP Commerce APIs to create seamless experience for employees and customers.
• Successfully facilitated client communication through end-of-day email updates, FDD tracker, and FSD progress charts.
• Refined data strategy document to identify existing SAP integrations, new SAP integrations, and integrations requiring enhancements between inner-circle and outer-circle implementations.
• Developed meeting minutes during the requirements gathering phase for understanding client needs between systems and business processes.

General Assembly

Associate Instructor (Contractor) October 2022 - October 2023

• Delivered supplemental instruction to Fortune 500 employees for mastery of Python, PostgreSQL, Microsoft Power BI, and Excel.
• Assisted students in gaining a fundamental and technical understanding of data science and analytics.
• Provided additional instruction to individuals falling behind by providing support for homework and projects

Internal Revenue Service

Data Science Fellow June 2021 - August 2021

• Improved existing processes and methods for aggregating and using operational data to conduct research and support high priority IRS operations.
• Engineered ETL workflow by creating a scp connection between IRS Server and EdgeNode within the Hadoop Cluster.
• Designed documentation using white paper methodology for easy replicability of SparkR environment.
• Improved processing times and increased stability of R code through the implementation of SparkR.
• Performed transformations and actions on SparkR DataFrames for statistical analysis, validation, program stability, and performance improvements.

University of Houston

Instructional Assistant December 2020 - May 2021

• Provided supplemental instruction for MBA and MS students in Quantitative Analysis (QA) and Financial Accounting (FA).
• Assisted students in understanding and internalizing fundamental concepts in QA and FA through one-on-one and group tutoring sessions.
• Collaborated with professors to highlight key topics discussed during class for supplemental instruction.

Harris County Public Health

Data Warehouse Intern May 2019 - August 2019

• Developed Microsoft Power BI Reports visualizing at risk populations within Harris County based on household income and school enrollment with geolocation, year and other relevant filters.
• Created scalar functions and stored procedures in SQL Server to validate the data contained within Power BI Reports.
• Developed a Python script to extract veterinary public health data into Python, applied transformations for data clean up, and loaded back into the correct directory” for parallel verb tense consistency to be used within the data warehouse.
• Created a program that utilized four fuzzy logic algorithms to give a confidence rating on how close an input matches a record in a table.

Digi-Safari

Hadoop and Spark Developer Boot Camp March 2018 - July 2018

• Transformed data in Spark leveraging SQL Context and Hive Context APIs to discover most profitable variables.
• Performed import and export operations on customer data between RDBMS and HDFS operating Sqoop.
• Ingested data into Spark-Shell using Avro, Parquet, JSON, CSV, TSV, and text file formats; transformed into DataFrame; performed SparkSQL queries and saved back on to HDFS applying Gzip compression.
• Designed external tables in Hive, ingested JSON formatted files from HDFS, and ran queries using Hive Query Language.
• Leveraged RDD APIs to calculate aggregations of orders inside spark-shell environment.
• Installed, configured, and tested Hadoop Ecosystem inside a virtual machine environment in Virtual Box.

Education

Purdue University

Doctorate in Technology December 2027

• Dissertation on comparative analysis of AI-generated clinical documentation. Evaluating the cost-efficiency and reliability of Clinic Scribe against enterprise ambient listening solutions.
• Wrote paper exploring how cloud platforms have democratized technology for small to mid-size companies by offering ready-to-deploy solutions for a range of business needs.
• Produced a paper detailing three different competency models and how they are leveraged in the food and beverage service industry, non-profits, and government agencies.
• Developed a paper to explain similarities and differences between the three most recent generational cohorts, Baby Boomers, Generation X, Generation Y (Millennials), and Generation Z.
• Authored a paper examining cybersecurity in healthcare, including government regulation, EMR security practices, universal health data sharing standards, and the implications and consequences of data breaches.

University of Houston

Master of Science in Business Analytics December 2021

• Developed R applications for data transformation, management and visualization.
• Created entity relationship diagrams, and data models for organization within a DBMS.
• Applied statistical modeling to data sets to uncover trends useful to business.
• Reported on cases and deaths involving COVID-19 using an S3 dataset to uncover why certain countries managed the pandemic better than others with similar population density and demographics.

Georgia State University

Bachelor of Business in Computer Information Systems May 2018

• Plugged in source code from GitHub into IBM Watson to predict an individual’s personality traits from sample text entered by the user.
• Team leader responsible for upgrading the existing systems in Golf Course Country Club that is experiencing increased stress on its bookings, maintenance and human resource departments due to growth in membership.
• Created mathematical and graphical models using Microsoft Excel to interpret data into meaningful information.
• Developed skills that are overlooked in business such as active listening, how to have unpleasant conversations, and working across cultural and ethnic differences.

Technical

Tech Stacks

  • Apache Spark (Spark-Shell)
  • Python
  • R
  • HDFS (Unix)
  • Apache SQOOP
  • Hive
  • Azure Synapse Analytics (Unix)
  • SQL
  • Azure Data Lake
  • Azure DevOps

Motivational Quotes

  • Five Percent of the Challenge Is the Strategy. Ninety-five Percent Is the Execution.

    Carlos Ghosn
  • “Education is the most powerful weapon which you can use to change the world.”

    Nelson Mandela

Get In Touch.

If you are interested in learning more about what I can offer you and your organization please do not hesitate to shoot me an email at IfrazAhmedProfessional@gmail.com.

Error boy
Your message was sent, thank you!