Skip to main content

Questions tagged [etl]

Extract, Transform, Load - process in a database

-1 votes
2 answers
194 views

There is a kinda ETL task of importing data from csv to the database in project with legacy codebase and legacy database**. Data should be validated before persisting to database. Validation includes ...
Rui's user avatar
  • 1,935
-1 votes
1 answer
228 views

I have a wide CSV file of about 350mb, and want to load it into a SQL database and properly model the data to make it easier to use for analysis. I could split the data into tables with python and ...
HappilyCoding's user avatar
1 vote
1 answer
234 views

I need to build a data pipeline to populate a database from various files. This is a common scenario. However, I want to have expert opinions for implementing a pipeline that is robust, modular and ...
Imtiaz's user avatar
  • 23
1 vote
1 answer
528 views

In our project we are using Django and Django Rest Framework as main application to get/query the data from database and send it to the frontend. Those endpoints are very fast as they should be. ...
Alex T's user avatar
  • 161
0 votes
1 answer
88 views

I have a situation where let's say I have a folder called logs which has N folders. Each folder contains events for a specific event type and each folder has N .log files where each file has multiple ...
Sriram R's user avatar
-2 votes
2 answers
491 views

I have thousands of .csv files with the same structure and, in most of the cases, some column values are the same ones recurring. Each file represents a report on some structures, with numeric ...
BoardsOfConsulting's user avatar
-1 votes
1 answer
37 views

I'm developing an ETL process in Python and Pandas to pull data from a rest API, and then dump it into a relational database. A few of the fields that come back contain sensitive that I do not want to ...
einkleindatagal's user avatar
0 votes
1 answer
53 views

I am looking for any general guidelines to allocate table space quota to different layers/schemas in ETL flow of a data warehouse (% of total space in each layer). As per my research, ETL flow can ...
Curious_Mind's user avatar

15 30 50 per page