Connecting to a CSV File with Data Cube
You can use Data Cube to collect, organize, and mine the data residing in various repositories across your enterprise.
With Data Cube you can upload multiple CSV files to the same data source, either by selecting multiple files or adding the files to a ZIP folder. When adding multiple files to the same data source, the data structure in the files must be identical for the data to be crawled successfully. If the structure of the CSV files are different, then you must use a new data source to crawl the data.
Before You Begin
- You must be able to log in to the Web Console to view Data Cube. See Accessing the Web Console.
- Only users assigned a role with the Data Connectors permission at the MediaAgent level can access Data Cube in the Analytics section of the Web Console. The associated MediaAgent must have been configured with Analytics Engine for Data Cube.
- In a Web browser, log in to the Web Console and then click Analytics > CSV.
- Under Data Connectors next to CSV, click Add New.
- On the New Data Source (CSV) page, configure the source as follows:
- Click the Analytics Engine list and select an Analytics Engine.
- In Data Source Name, enter a name for the data source. The name cannot contain spaces.
- Click Next.
- To upload files from a local computer:
- Select Upload CSV File(s) and click Upload
- Select one or more local CSV files. You can also select multiple CSV files included in a ZIP folder. See Uploading Multiple CSV Files.
- Click Open.
A box containing upload progress information appears in the lower-right corner of the page.
- To upload files from a shared path or local path on the Analytics Engine selected for this data source:
- Select Specify CSV File(s) Location.
- In Folder Path, enter the local folder path or share path to the CSV file.
Note: The location of the CSV file is relative to the Analytics Engine selected for this data source.
If you entered a share path, enter the User Name and Password to access the share.
- Enter the remaining information as follows:
- Under Column Separator, select or enter the character used to separate the items in the CSV file(s).
- If the first row of the CSV file(s) contains column heading names, select First row has column name. If you uploaded multiple CSV files, then the column names of the first CSV file are added and the first row of the other CSV fiels are skipped.
- Optional: In the Columns box, enter column heading names for the CVS file(s) as comma separated values. If you selected First row has column name, the values in the Columns box will overwrite the values in the first row of the CSV file(s). If you did not select the First row has column name option, then the first row of the CSV file(s) will be added as values under your custom column names.
- Optional: Select Incremental Crawl to perform one full crawl of the data and subsequently crawl only the data that has changed from the previous crawl.
- If the CSV file has a primary key, enter the column name of the primary key in Primary Key.
Note: The primary key must contain unique entries. If two primary key entries are the same, then only the last entry will be crawled.
- Optional: Select Start Crawling Now to begin indexing the information in the CSV file immediately after you save the source.
- When finished, click Save.
The source appears on the Data Source (CSV) page.