Integrating External Data Sources in Models
Q: How do you incorporate external data sources into data models?
- Data modeling
- Senior level question
Explore all the latest Data modeling interview questions and answers
ExploreMost Recent & up-to date
100% Actual interview focused
Create Data modeling interview for FREE!
When incorporating external data sources into data models, the first step is to identify the source of the data and ensure that it is reliable. This means understanding the provenance of the data, who created it and how it was created, and if necessary, validating the data with the source. Additionally, it is important to consider the format of the data, as this will affect how it is used when incorporating it into the data model.
Once the source and format of the data have been identified, it is necessary to determine how it will be incorporated into the existing data model. This involves mapping the data from the external source to the existing data model, which requires an understanding of the data model and the external source. This mapping should also consider any transformations that may need to be applied to the data in order to maintain data integrity.
Finally, once the data has been mapped to the existing data model, it needs to be incorporated into the data model in a way that is consistent with the other data. This may involve creating new tables, adding columns to existing tables, or updating existing data with the newly received information.
For example, imagine that a company has a data model that incorporates customer information (name, address, contact info, etc) and they receive a new dataset from a third-party vendor that includes customer purchase information (product name, product quantity, purchase date, etc). To incorporate this data into their existing data model, the data need to be mapped to the existing customer table, which may require creating a new table for the purchase information, or adding columns to the existing customer table. Once the data has been mapped and incorporated into the model, the company will then have a more complete view of their customer data.
Once the source and format of the data have been identified, it is necessary to determine how it will be incorporated into the existing data model. This involves mapping the data from the external source to the existing data model, which requires an understanding of the data model and the external source. This mapping should also consider any transformations that may need to be applied to the data in order to maintain data integrity.
Finally, once the data has been mapped to the existing data model, it needs to be incorporated into the data model in a way that is consistent with the other data. This may involve creating new tables, adding columns to existing tables, or updating existing data with the newly received information.
For example, imagine that a company has a data model that incorporates customer information (name, address, contact info, etc) and they receive a new dataset from a third-party vendor that includes customer purchase information (product name, product quantity, purchase date, etc). To incorporate this data into their existing data model, the data need to be mapped to the existing customer table, which may require creating a new table for the purchase information, or adding columns to the existing customer table. Once the data has been mapped and incorporated into the model, the company will then have a more complete view of their customer data.


