我正在关注管道教程 https://kedro.readthedocs.io/en/latest/03_tutorial/04_create_pipelines.html,创建所有需要的文件,启动 kedrokedro run --node=preprocessing_data
但遇到了这样的错误消息:
ValueError: Pipeline does not contain nodes named ['preprocessing_data'].
如果我运行 kedro 时没有node
参数,我收到
kedro.context.context.KedroContextError: Pipeline contains no nodes
文件内容:
src/project/pipelines/data_engineering/nodes.py
def preprocess_data(data: SparkDataSet) -> None:
print(data)
return
src/project/pipelines/data_engineering/pipeline.py
def create_pipeline(**kwargs):
return Pipeline(
[
node(
func=preprocess_data,
inputs="data",
outputs="preprocessed_data",
name="preprocessing_data",
),
]
)
src/project/pipeline.py
def create_pipelines(**kwargs) -> Dict[str, Pipeline]:
de_pipeline = de.create_pipeline()
return {
"de": de_pipeline,
"__default__": Pipeline([])
}