Using tGoogleAnalyticsInput in JasperETL/Talend
This is a short article on getting Google Analytics into JaspersoftETL/Talend ETL tools. Google Analytics has a nice interface and many great ways of analyzing web traffic, it'd be hard to beat the work Google did there. The purpose of this connector would be to bring the data into your own data warehouse and join it with other intelligence you might have. A great way to display the aggregated data would be with a BI tool like Jaspersoft.
I'm writing because the help for the tGoogleAnalyticsInput is in French and incomplete.
So let's get started. You'll need JasperETL 4.1.2 or older (commercial editions here, open source here).
Installing the plugin:
Configure the component. First start with finding some fields you'd like to use from the Google API: http://code.google.com/apis/analytics/docs/gdata/dimsmets/dimsmets.html. Note that you need to have some metrics (can't be all dimensions, vice versa).
I'm writing because the help for the tGoogleAnalyticsInput is in French and incomplete.
So let's get started. You'll need JasperETL 4.1.2 or older (commercial editions here, open source here).
Installing the plugin:
Create a new Job Design and drag the tGoogleAnalyticsInput component from the Palette (under Internet):
Configure the component. First start with finding some fields you'd like to use from the Google API: http://code.google.com/apis/analytics/docs/gdata/dimsmets/dimsmets.html. Note that you need to have some metrics (can't be all dimensions, vice versa).
Here's an example schema (you'll have to put in the fields yourself):
Now configure the component. In order to get your Profile ID (note it doesn't start with UA) read the cryptic Google Documentation about it. I got mine by examining the URL while logged into GA and looking at the &id= parameter.
The "value" column comes directly from the Google API and will also have to be filled in. I've also found that I HAVE to filter on period. You can do 1980-2020 if you want to include everything.
The "value" column comes directly from the Google API and will also have to be filled in. I've also found that I HAVE to filter on period. You can do 1980-2020 if you want to include everything.
Funny enough. If you're reading this right now then you will eventually flow through the GA database and the very job I'm displaying above ;)
Comments
So to get yesterday and today:
End Date (today):
TalendDate.getDate("yyyy-MM-dd")
Start Date(tomorrow):
TalendDate.addDate(TalendDate.getDate("yyyy-MM-dd"),"yyyy-MM-dd",-1,"dd")
* -1 is the field that I added
I tried giving all the required parms
and the dates are giving error, MM can not be resolved to a variable.Any clue? The date I gave is "2012-04-11" and "2013-07-10".What is wrong here?
Anitha.