By Timo Selvaraj
Managing your text content within databases both SQL and NoSQL based, csv, excel, emails, twitter streams, forms, documents and custom apps can be challenging especially when performing ETL (Extraction, Transformation and Loading) operations. Unstructured text within any of these data sources need to be transformed for the required end user operations including removal of stop words, identification of entities, clustering of themes and filtering or sorting based on terms. SearchBlox is an excellent tool to manage multiple data sources with both text, date and number based content given the abilities to normalize the data from many sources out of the box as well as provide a common filtering and sorting mechanism to find the right data.
Extract specific columns from multiple databases, specific columns from csv flat files, meta data or text content from webpages or rss feeds, social streams from twitter from cloud or on-premise data sources.
Transform the varied data sources to the common format that is required by your business for fast access or input into another system or app. Specify the fields SearchBlox can use or create your own custom fields easily.
Load on-demand, continuously or on a schedule once the collections are setup for the different data or content sources.
Read our recent Blog posts
4870 Sadler Road Suite 300
Glen Allen, VA 23060
Phone: (866) 933-3626