• Home
        • Database Development

          Database development is designing, creating a database or data model, and analyzing requirements and their intents as raw data.

          Learn More
        • Architecture & Design

          Software architecture refers to the fundamental structures of a software system and the discipline of creating such structures and systems.

          Learn More
        • Programming

          Computer programming is the process of performing a particular computation or more generally, accomplishing a specific result.

          Learn More
        • Cloud Computing

          Cloud computing is the on-demand availability of computer system resources, especially data storage and computing power.

          Learn More
        • ETL Development

          ETL provides the foundation for data analytics and machine learning workstreams. Through a series of business rules, ETL cleanses and organizes data.

          Learn More
        • Data Visualization & Reports

          Data and information visualization is an interdisciplinary field that deals with the graphic representation of data and information.

          Learn More
  • Blog
  • Contact

Using SSIS to Import HTTP-Based XML Document

Introduction
The post, XML Document Sourced SSRS Report Using HTTP URL, demonstrated how a SQL Server Reporting Services (SSRS) data source can be used to reference an Extensible Markup Language (XML) document located via Hypertext Transfer Protocol (HTTP) protocol-based URL. Another approach to reporting off an XML document is to first extract the XML data using SQL Server Integration Services (SSIS), store it in a relational engine such as such Microsoft SQL Server, and then point your SSRS data sources to SQL Server.
Similar to the aforementioned post, the XML document that we will be using is the sitemap file of select SIFISO blogs which is located here.
The aforementioned document looks as shown below:
"14"
Source Code Changesets
The changesets associated with the source code for this article can be found on GitHub and CodePlex as shown below, respectively:
"9""8"
Walkthrough
Okay, let’s get started.
Add a new SSIS package and assign it a name (I have labeled mine, ImportSitemap_data)
"10"
Navigate to SSIS Toolbox in SQL Server Data Tools (SSDT) SSIS Designer. Drag and drop a data flow task into the control flow. I prefer to assign meaningful names to my SSIS components so I have renamed my data flow task to Data Flow Task – Import Sitemap data.
"1""dft"
Let’s configure the newly added data flow task.
Click on the Data Flow tab next to Control Flow. Navigate to the SSIS Toolbox window. Drag and drop an XML Source component into the data flow. Rename the XML Source component to XML Source – blog sitemap.
At this point your package should look as follows:
"12""11"
Let’s configure the XML Source component.
Under the connection manager tab, set Data access mode: property to the XML file location.
Set the XML location to http://www.blogs.selectsifiso.com/sitemap.xml.
You will soon be required to provide an XML Schema Definition file (XSD). The easiest way is to generate the XSD file using the XML location document. During the generating of the file, you will be asked for a save-file location (I saved mine in c drive).
"4"
Click on the columns tab to refresh the metadata and then click OK to commit the configuration settings.
"5"
That basically concludes the settings for extracting the XML data.
However, for the purposes of troubleshooting and ensuring that the package works, we can go on and add a Row Sampling transformation component.
Connect the XML source component to Row Sampling.
"2"
Configure the Row Sampling component as follows:
"6"
"7"
Finally, add a data viewer to take a look at the data as you extract it during runtime:
"dataviewer"
Your data flow should look as follows:
"11"
Conclusion
That’s all folks.
I hope you were able to learn something from this post.
Until next time, cheers.
CodeProject

Loading

Sifiso

October 13, 2022
Sifiso is Data Architect and Technical Lead at SELECT SIFISO – a technology consulting firm focusing on cloud migrations, data ingestion, DevOps, reporting and analytics. Sifiso has over 15 years of across private and public business sectors, helping businesses implement Microsoft, AWS and open-source technology solutions. He is the member of the Johannesburg SQL User Group and also hold a Master’s Degree in MCom IT Management from the University of Johannesburg.

Meet Our Experts

We are proud to have a team of experts who are passionate about delivering the best possible solutions to you. Our team members are highly skilled and experienced in a diverse range of IT technologies, and are committed to staying up-to-date with the latest industry trends and best practices to deliver you the best results.

0 Comments

Submit a Comment

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Join Our Newsletter

Subscribe to get our latest and best thinking on the most definitive workforce topics affecting HR leaders and organizations today.