PROMPT LINEAGE AND GOVERNANCE IN LLM-ENABLED DATA ENGINEERING: A REFERENCE ARCHITECTURE

Shambhu Adhikari

PDF

Published: 2024-08-22

Keywords:

Large Language Models, Data Engineering, Prompt Governance, Prompt Lineage, LLMOps

Shambhu Adhikari

Sr. Data Engineer - United Airlines, NW, NJ

Abstract

modern data engineering ecosystems to automate data transformation,
quality assurance, metadata generation, and analytical reasoning. While
these models enhance productivity and adaptability, they introduce significant governance challenges due
to their probabilistic behavior and heavy reliance on prompts as executable control artifacts. Unlike
traditional data pipelines, where logic is encoded in version-controlled code, LLM-enabled systems often
embed prompts in orchestration layers without formal lifecycle management, lineage tracking, or policy
enforcement. This absence of prompt governance undermines reproducibility, auditability, and regulatory
compliance in enterprise data platforms. This paper proposes a reference architecture for prompt lineage
and governance in LLM-enabled data engineering environments. Drawing on principles from DataOps,
MLOps, metadata management, and responsible AI, the architecture treats prompts as first-class governed
assets. It enables versioning, lineage tracking, metadata capture, and policy enforcement across prompt
creation, deployment, and execution. The proposed architecture integrates with modern lakehouse
platforms, orchestration engines, and observability tools to provide end-to-end transparency across data,
prompts, models, and outputs. This study contributes a structured and practical framework to support
scalable, compliant, and trustworthy adoption of LLMs in data engineering workflows.

Downloads

Download data is not yet available.

Issue

Vol. 1 No. 1 (2024): International Journal of Business & Computational Science (Volume 01) 2024

Section

Articles

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

PROMPT LINEAGE AND GOVERNANCE IN LLM-ENABLED DATA ENGINEERING: A REFERENCE ARCHITECTURE

Abstract

Downloads

Issue

Section

Most read articles by the same author(s)

Similar Articles

Article Sidebar

Main Article Content

Abstract

Downloads

Article Details

Issue

Section

Most read articles by the same author(s)

Similar Articles