<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Daniel Koller</title>
	<atom:link href="http://blog.dakoller.net/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.dakoller.net</link>
	<description>Data Science &#38; Big Data, Startups und der Rest</description>
	<lastBuildDate>Sun, 13 Jan 2013 14:26:03 +0000</lastBuildDate>
	<language>de</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='blog.dakoller.net' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/e5f778ecfa0067f057ec4e94950e6093?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>Daniel Koller</title>
		<link>http://blog.dakoller.net</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://blog.dakoller.net/osd.xml" title="Daniel Koller" />
	<atom:link rel='hub' href='http://blog.dakoller.net/?pushpress=hub'/>
		<item>
		<title>Using django-social-auth with the XING api &#8230; oAuth in action!</title>
		<link>http://blog.dakoller.net/2012/10/31/using-django-social-auth-with-the-xing-api-oauth-in-action/</link>
		<comments>http://blog.dakoller.net/2012/10/31/using-django-social-auth-with-the-xing-api-oauth-in-action/#comments</comments>
		<pubDate>Wed, 31 Oct 2012 16:36:47 +0000</pubDate>
		<dc:creator>dakoller</dc:creator>
				<category><![CDATA[data science]]></category>
		<category><![CDATA[rapidslides]]></category>
		<category><![CDATA[startup]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[django]]></category>
		<category><![CDATA[linkedin]]></category>
		<category><![CDATA[oauth]]></category>
		<category><![CDATA[python]]></category>
		<category><![CDATA[xing]]></category>

		<guid isPermaLink="false">http://blog.dakoller.net/?p=264</guid>
		<description><![CDATA[This blog is about how to connect to the XING API via oAuth: my use case was to implement a &#8220;Login with XING&#8221; option in a web application. XING is the european pendant to LinkedIn as social network for professionals, so there are bunch of people, who are using these two networks in order to [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.dakoller.net&#038;blog=33883038&#038;post=264&#038;subd=dakoller&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>This blog is about how to connect to the XING API via oAuth: my use case was to implement a &#8220;Login with XING&#8221; option in a web application.</p>
<p>XING is the european pendant to LinkedIn as social network for professionals, so there are bunch of people, who are using these two networks in order to maintain their connections ( <a href="http://datenprodukt.dakoller.net/products/">datenprodukt.dakoller.net</a> might provide a helper for this challenge ) . Luckily for me, XING started with an API in closed beta state (signup at <a href="https://dev.xing.com/">https://dev.xing.com/</a>).</p>
<p>My setup is a Django-based web application hosted on Heroku, and I am using the last stable version 1.4.1.</p>
<p>This blog is about what I needed to integrate with the app, how I adapted the connection and how I can use it now.<span id="more-264"></span></p>
<p><a href="https://github.com/omab/django-social-auth">Django-social-auth</a> is  well <a href="http://django-social-auth.readthedocs.org/en/latest/index.html">documented</a> and has already today a broad list of other<a href="https://github.com/omab/django-social-auth#id3"> web applications</a>, with which you can connect quite fast. The really big advantage in terms of software architecture is, that it uses the already good <a href="https://docs.djangoproject.com/en/dev/topics/auth/">user administration &amp; auth handling</a>, which is already inbuilt in Django and provides a complete abstraction from the provider specific settings, when you are inside your app.</p>
<p>You can now plugin any kind of oAuth enabled web application, without specific auth handling in the application. (However you still have to do the specific API calls after you got the access tokens.)</p>
<p>The needed packages are:</p>
<pre>Django==1.4.1
django-social-auth==0.7.6
httplib2==0.7.6
oauth2==1.5.211</pre>
<p>Dependencies of these packages are installed using the pip-installer system in Python.</p>
<p>The main piece is a custom module, which talks the oAuth process needed by XING. The code is also available on <a href="https://github.com/dakoller/django-social-auth4xing">GitHub</a>.</p>
<pre>"""
XING OAuth support

No extra configurations are needed to make this work.
"""
from xml.etree import ElementTree
from xml.parsers.expat import ExpatError

from oauth2 import Token
import oauth2 as oauth

from social_auth.utils import setting
from social_auth.backends import ConsumerBasedOAuth, OAuthBackend, USERNAME
from social_auth.backends.exceptions import AuthCanceled, AuthUnknownError

from pprint import pprint

import settings
import simplejson as json

XING_SERVER = 'xing.com'
XING_REQUEST_TOKEN_URL = 'https://api.%s/v1/request_token' % \
                                    XING_SERVER
XING_ACCESS_TOKEN_URL = 'https://api.%s/v1/access_token' % \
                                    XING_SERVER
XING_AUTHORIZATION_URL = 'https://www.%s/v1/authorize' % \
                                    XING_SERVER
#XING_CHECK_AUTH = 'https://api.%s/v1/users/me.json' % XING_SERVER
XING_CHECK_AUTH = 'https://api.%s/v1/users/me.xml' % XING_SERVER

class XingBackend(OAuthBackend):
    """Xing OAuth authentication backend"""
    name = 'xing'
    EXTRA_DATA = [('id', 'id'),('user_id','user_id')]

    def get_user_details(self, response):
        """Return user details from Xing account"""
        pprint(response)
        first_name, last_name = response['first_name'], response['last_name']
        #first_name='Alfred E.'
        #last_name='Neumann'
        email = response.get('email', '')
        return {USERNAME: first_name + last_name,
                'fullname': first_name + ' ' + last_name,
                'first_name': first_name,
                'last_name': last_name,
                'email': email}

class XingAuth(ConsumerBasedOAuth):
    """Xing OAuth authentication mechanism"""
    AUTHORIZATION_URL = XING_AUTHORIZATION_URL
    REQUEST_TOKEN_URL = XING_REQUEST_TOKEN_URL
    ACCESS_TOKEN_URL = XING_ACCESS_TOKEN_URL
    SERVER_URL = 'api.%s' % XING_SERVER
    AUTH_BACKEND = XingBackend
    SETTINGS_KEY_NAME = 'XING_CONSUMER_KEY'
    SETTINGS_SECRET_NAME = 'XING_CONSUMER_SECRET'
    SCOPE_VAR_NAME=None
    SCOPE_SEPARATOR = '+'

    def user_data(self, access_token, *args, **kwargs):
        """Return user data provided"""
        url = XING_CHECK_AUTH

        consumer = oauth.Consumer(key=settings.XING_CONSUMER_KEY, secret=settings.XING_CONSUMER_SECRET)
        client= oauth.Client(consumer,access_token)

        resp, content = client.request('https://%s%s' % ('api.xing.com','/v1/users/me.json'), "GET")
        profile= json.loads(content)['users'][0]
        #pprint(profile)

        try:
            #return to_dict(ElementTree.fromstring(raw_xml))
            return {'user_id':profile['id'],'id':profile['id'],'first_name': profile['first_name'],'last_name': profile['last_name'],'email': profile['active_email']}
        except (ExpatError, KeyError, IndexError):
            return None

    def auth_complete(self, *args, **kwargs):
        """Complete auth process. Check Xing error response."""
        oauth_problem = self.request.GET.get('oauth_problem')
        if oauth_problem:
            if oauth_problem == 'user_refused':
                raise AuthCanceled(self, '')
            else:
                raise AuthUnknownError(self, 'Xing error was %s' %
                                                    oauth_problem)
        return super(XingAuth, self).auth_complete(*args, **kwargs)

    def get_scope(self):
        """Return list with needed access scope"""
        scope = []
        if self.SCOPE_VAR_NAME:
            scope = setting(self.SCOPE_VAR_NAME, [])
        else:
            scope = []
        return scope

    def unauthorized_token(self):
        """Makes first request to oauth. Returns an unauthorized Token."""
        request_token_url = self.REQUEST_TOKEN_URL
        scope = self.get_scope()
        if scope:
            qs = 'scope=' + self.SCOPE_SEPARATOR.join(scope)
            request_token_url = request_token_url + '?' + qs

        request = self.oauth_request(
            token=None,
            url=request_token_url,
            extra_params=self.request_token_extra_arguments()
        )
        response = self.fetch_response(request)
        return Token.from_string(response)

def to_dict(xml):
    """Convert XML structure to dict recursively, repeated keys entries
    are returned as in list containers."""
    children = xml.getchildren()
    if not children:
        return xml.text
    else:
        out = {}
        for node in xml.getchildren():
            if node.tag in out:
                if not isinstance(out[node.tag], list):
                    out[node.tag] = [out[node.tag]]
                out[node.tag].append(to_dict(node))
            else:
                out[node.tag] = to_dict(node)
        return out

# Backend definition
BACKENDS = {
    'xing': XingAuth,
}</pre>
<p>Integration requires to put the oAuth consumer tokens into settings.py:</p>
<pre># get these data from dev.xing.com
XING_CONSUMER_KEY = "xxx"
XING_CONSUMER_SECRET = "yyy"
XING_BASE_URL='https://api.xing.com'</pre>
<p>Additionally you need to tell django-social-auth, that you want to use an additional auth module (for Xing) in your application.</p>
<pre>AUTHENTICATION_BACKENDS = (
    'nwg.xing.XingBackend',
    'django.contrib.auth.backends.ModelBackend',
)</pre>
<p>The effect of this integration is now as follows:</p>
<ul>
<li>people can login to the application with &#8216;/login/xing&#8217;,</li>
<li>existing users of the application can associate their Xing accoutn by calling /associate/xing and</li>
<li>the application get the oAuth access tokens (for accessing data &amp; initiating actions) by querying the UserSocialAuth objects.</li>
</ul>
<p>Overall integration dajngo-social-auth with Xing went quite smooth, special kudos goes to the <a href="https://github.com/omab">creator of the package, Matias Aguirre</a>, for the effort to setup the package and for the support in my special case.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dakoller.wordpress.com/264/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dakoller.wordpress.com/264/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.dakoller.net&#038;blog=33883038&#038;post=264&#038;subd=dakoller&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.dakoller.net/2012/10/31/using-django-social-auth-with-the-xing-api-oauth-in-action/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/ee0740804be17ca85d68e2cafd4fb989?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">dakoller</media:title>
		</media:content>
	</item>
		<item>
		<title>Now also with blogs/lessons learned from RapidSlides.com</title>
		<link>http://blog.dakoller.net/2012/08/26/now-also-with-blogslessons-learned-from-rapidslides-com/</link>
		<comments>http://blog.dakoller.net/2012/08/26/now-also-with-blogslessons-learned-from-rapidslides-com/#comments</comments>
		<pubDate>Sun, 26 Aug 2012 10:03:42 +0000</pubDate>
		<dc:creator>dakoller</dc:creator>
				<category><![CDATA[rapidslides]]></category>
		<category><![CDATA[startup]]></category>

		<guid isPermaLink="false">http://blog.dakoller.net/?p=261</guid>
		<description><![CDATA[As you might be aware of, I am currently working on a startup called RapidSlides.com. RapidSlides creates 80% ready business presentations based on visual input. The input comes from photos of whiteboards / flip charts or hand-drawn sketches from your iPad. We use your layout or create one based on your website and deliver our [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.dakoller.net&#038;blog=33883038&#038;post=261&#038;subd=dakoller&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>As you might be aware of, I am currently working on a startup called <a href="http://rapidslides.com/">RapidSlides.com</a>.</p>
<p>RapidSlides creates 80% ready business presentations based on visual input. The input comes from photos of whiteboards / flip charts or hand-drawn sketches from your iPad. We use your layout or create one based on your website and deliver our results back to in max. 24  hours.</p>
<p>You want to see how it looks like? Just head over to <a href="http://rapidslides.com/de/beispiele">http://rapidslides.com/de/beispiele</a> , where you can see first examples from customer orders.</p>
<p><a href="http://rapidslides.com/de/beispiele"><img class="alignnone" src="http://rapidslides.com/de/wp-content/uploads/2012/07/Cover_TestDrivenDevelopment_thumb.png" alt="" width="346" height="213" /></a></p>
<p>&nbsp;</p>
<p>You can also participate in the currently running test phase: just sign up at <a href="http://rapidslides.com/">http://rapidslides.com/</a> .</p>
<p>We (that means  <a href="https://www.xing.com/profile/Ralf_Westbrock">Ralf Westbrock</a> from <a href="http://str84wd.com/">http://str84wd.com/</a> and me ) are operating from Munich Germany and follow the <a href="http://theleanstartup.com/">Lean Startup</a> methodolody.</p>
<p>My role there is mainly the technical part and i&#8217;ll blog here about my lessons learned of the project. Do you have already specific aspects in which you are interested? Just comment here or reach out on Twitter at <a href="http://twitter.com/dakoller">@dakoller</a> .</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dakoller.wordpress.com/261/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dakoller.wordpress.com/261/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.dakoller.net&#038;blog=33883038&#038;post=261&#038;subd=dakoller&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.dakoller.net/2012/08/26/now-also-with-blogslessons-learned-from-rapidslides-com/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/ee0740804be17ca85d68e2cafd4fb989?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">dakoller</media:title>
		</media:content>

		<media:content url="http://rapidslides.com/de/wp-content/uploads/2012/07/Cover_TestDrivenDevelopment_thumb.png" medium="image" />
	</item>
		<item>
		<title>REGISTRATION STARTED &#8211; 2. Data Science Day, August 22, Berlin</title>
		<link>http://blog.dakoller.net/2012/07/16/registration-started-2-data-science-day-august-22-berlin/</link>
		<comments>http://blog.dakoller.net/2012/07/16/registration-started-2-data-science-day-august-22-berlin/#comments</comments>
		<pubDate>Mon, 16 Jul 2012 08:45:22 +0000</pubDate>
		<dc:creator>dakoller</dc:creator>
				<category><![CDATA[data science]]></category>
		<category><![CDATA[dsday]]></category>

		<guid isPermaLink="false">http://blog.dakoller.net/?p=255</guid>
		<description><![CDATA[REGISTRATION STARTED &#8211; 2. Data Science Day, August 22, Berlin. 1. Data Science Day was a great event &#8230; so the next upcoming one is worth a visit too. The focus topic will be Game Analytics. I heard rumors about an idea to got to Munich for one of the next Data Science Days&#8230; do [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.dakoller.net&#038;blog=33883038&#038;post=255&#038;subd=dakoller&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://wp.me/p2vLZw-1T">REGISTRATION STARTED &#8211; 2. Data Science Day, August 22, Berlin</a>.</p>
<p>1. Data Science Day was a great event &#8230; so the next upcoming one is worth a visit too. The focus topic will be Game Analytics.</p>
<p>I heard rumors about an idea to got to Munich for one of the next Data Science Days&#8230; do you have any feedback ? Any special focus topics you think of?</p>
<p>&nbsp;</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dakoller.wordpress.com/255/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dakoller.wordpress.com/255/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.dakoller.net&#038;blog=33883038&#038;post=255&#038;subd=dakoller&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.dakoller.net/2012/07/16/registration-started-2-data-science-day-august-22-berlin/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/ee0740804be17ca85d68e2cafd4fb989?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">dakoller</media:title>
		</media:content>
	</item>
		<item>
		<title>Use social networks to serve more relevant people better (Business Ideas)</title>
		<link>http://blog.dakoller.net/2012/06/13/use-social-networks-to-serve-more-relevant-people-better-business-ideas/</link>
		<comments>http://blog.dakoller.net/2012/06/13/use-social-networks-to-serve-more-relevant-people-better-business-ideas/#comments</comments>
		<pubDate>Wed, 13 Jun 2012 06:19:55 +0000</pubDate>
		<dc:creator>dakoller</dc:creator>
				<category><![CDATA[business ideas]]></category>
		<category><![CDATA[crm]]></category>
		<category><![CDATA[influence]]></category>
		<category><![CDATA[klout]]></category>
		<category><![CDATA[peerindex]]></category>
		<category><![CDATA[social media]]></category>

		<guid isPermaLink="false">https://dakoller.wordpress.com/?p=249</guid>
		<description><![CDATA[In the business ideas category I post potential business ideas, which I cannot work on the moment, but which may be good input to other people. You might have seen the nice pic on Facebook, which said &#8220;Please fix my phone or I will tell my X-thousands of followers about your bad performance&#8217;&#8221;. This sounds [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.dakoller.net&#038;blog=33883038&#038;post=249&#038;subd=dakoller&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>In the business ideas category I post potential business ideas, which I cannot work on the moment, but which may be good input to other people.</p>
<hr />
<p>You might have seen the nice pic on Facebook, which said<br />
&#8220;Please fix my phone or I will tell my X-thousands of followers about your bad performance&#8217;&#8221;.</p>
<p>This sounds like a joke at the moment, but in terms of customer relationship management you might fear the bad sentiment about your company, which is conveyed to a (potentially) very large group of people by messages like this one. You also might have heard already about the very positive impact on your brand, when you react on customer feedback in social online media in a very constructive, friendly &amp; timely manner.</p>
<p>So not only in case of customer service tasks, but also for customer acquisition tasks you are interested in making influential people to talk about your product or service.</p>
<p>But how would the company know who is important/influential on social networks and in which peer groups their voice is heard best? There is a solution for that!</p>
<p>So the idea is:</p>
<p><strong>Value Proposition:</strong><br />
<strong>Enable companies to put  special attention on influential people on social networks (among their customer &amp; prospects).</strong></p>
<p>This enables them to e.g.</p>
<ul>
<li>distribute limited special goodies, (such as  beta invites, coupons etc.) to the people influential in their domain,</li>
<li>prioritize them in customer service applications or</li>
<li>approach them for feedback on your service.</li>
</ul>
<div>You might even think of offering special treatment, when an influential person enters a chatbox on your website. (a kind of learning for me from <a href="http://blog.dakoller.net/2012/06/13/1-data-science-day-in-deutschland-ein-kurzer-review/">1. german data Science Day &#8211; a (german) review</a> is available)</div>
<p><strong>Solution:</strong><br />
<strong>Match their customer information with data sources on influential people, e.g. based on the services of  <a href="http://klout.com">Klout</a> or <a href="http://www.peerindex.com/">PeerIndex</a>.</strong></p>
<p>(yes: there is sometimes bad sentiment about services like Klout, as they reduce a persons relevance to a single number &#8211; and they don&#8217;t tell you in detail, how they are doing it. This bad sentiment goes as far as motivating people to setup fake services such as <a href="http://flout.com">flout.com</a>. I think these services learned from this feedback, e.g. to tell you also on which topics someone is influential.)</p>
<p>Many of these service, including the two mentioned ones, make their information available as APIs, so one could link this information with customer information.</p>
<p>This could go as far as: &#8220;Distribute 100 books coupons to the most important people in the area of <a href="http://en.wikipedia.org/wiki/Rosamunde_Pilcher">&#8216;Rosamunde Pilcher&#8217; literature</a> in my customer list.&#8221;</p>
<p><strong>Secret sauce: </strong>(important to not get immediately a victim of other people copying your service)</p>
<p>Companies wouldn&#8217;t want to do this kind of data wrangling from APIs and the merging with other data sources on their own. They likely also don&#8217;t want to invest the infrastructure, which is needed the keep this data current and to hint the company to relevant changes over time.</p>
<p>However, there is a caveat: you need a good story (or a list of precautions to show the companies, that you work with their data in a secure way)</p>
<p>&nbsp;</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dakoller.wordpress.com/249/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dakoller.wordpress.com/249/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.dakoller.net&#038;blog=33883038&#038;post=249&#038;subd=dakoller&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.dakoller.net/2012/06/13/use-social-networks-to-serve-more-relevant-people-better-business-ideas/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/ee0740804be17ca85d68e2cafd4fb989?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">dakoller</media:title>
		</media:content>
	</item>
		<item>
		<title>1. Data Science Day in Deutschland: ein kurzer Review</title>
		<link>http://blog.dakoller.net/2012/06/13/1-data-science-day-in-deutschland-ein-kurzer-review/</link>
		<comments>http://blog.dakoller.net/2012/06/13/1-data-science-day-in-deutschland-ein-kurzer-review/#comments</comments>
		<pubDate>Wed, 13 Jun 2012 06:06:23 +0000</pubDate>
		<dc:creator>dakoller</dc:creator>
				<category><![CDATA[data science]]></category>
		<category><![CDATA[datascience]]></category>
		<category><![CDATA[dsc]]></category>
		<category><![CDATA[dsday]]></category>

		<guid isPermaLink="false">https://dakoller.wordpress.com/?p=247</guid>
		<description><![CDATA[Am 6.6. fand in Berlin der erste Data Science Day in Deutschland statt: organisiert hat ihn Klaas Bollhöfer (@klabol)&#8230; aber das war nicht die erste Aktion zum Thema: der Data Science Day war aber der erste Höhepunkt einer aktiven Communityarbeit (auf XING) seit Januar , die die dort mittlerweile 92 Mitglieder einbindet. (Ein gutes Zeichen [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.dakoller.net&#038;blog=33883038&#038;post=247&#038;subd=dakoller&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Am 6.6. fand in Berlin der <strong>erste Data Science Day in Deutschland</strong> statt: organisiert hat ihn Klaas Bollhöfer (<a href="http://twitter.com/#!/klabol">@klabol</a>)&#8230; aber das war nicht die erste Aktion zum Thema: der Data Science Day war aber der erste Höhepunkt einer<a href="https://www.xing.com/net/pri142310x/datascience/"> aktiven Communityarbeit (auf XING)</a> seit Januar , die die dort mittlerweile 92 Mitglieder einbindet.</p>
<p>(Ein gutes Zeichen ist auch, daß mittlerweile auch deutsche Unternehmen die Relevanz offensichtlich verstanden haben, was man daran sehen kann, dass sie Mitarbeiter dort teilnehmen liessen und das Event auch gesponsert haben, z. B.<a href="http://twitter.com/#!/immobilienscout"> @Immobilienscout</a>, <a href="http://twitter.com/#!/wooga">@Wooga</a>)</p>
<p>Bedeutend ist, daß wir letzte Woche an einem Platz 70 Gäste da hatten, die den ziemlich breiten Raum des Begriffs Data Science ziemlich komplett ausgeleuchtet haben. (Vorher hatte ich hier manchmal das Gefühl, in einem sehr exotischen Feld zu arbeiten)</p>
<p>Die Bandbreite der Themen ging von<br />
- Frameworks und Tools (z.B. Hadoop, Twitter Storm),<br />
- Handling von strukturierter und unstrukturierter Information (z.B. bessere Suche, Inhaltsanalyse und Zusammenfassungen von Text mit den interessanten Übergängen ins Semantic Web) über<br />
- die Relevanz von Open Data (als Basis für öffentliche Informationsangebote und aufrüttelnde Unterstützung für Kampagnen () bis hin zu<br />
- Visualisierungen.(z.B. auf Basis von Processing).</p>
<p>Der Vormittag war gefüllt von gesetzten Präsentationen ( Agenda unter <a href="http://de.amiando.com/datascienceday.html">http://de.amiando.com/datascienceday.html</a>)  : gut war, daß Klaas die so kurz halten konnte, daß viel Raum für Diskussionen blieb. Am Nachmittag ging es drei parallel laufenden und kleineren Gruppen um<br />
- sehr technische Fragen, (&#8220;was macht man mit Twitter Storm?&#8221;),<br />
- visionärere Themen (&#8220;was kann man noch mit XYZ machen?&#8221;) und<br />
- eine Gruppe zum Thema Texthandling und Open Data-Nutzung.</p>
<p>Aus diesem Tag habe ich folgende Beobachtungen mitgenommen<br />
-<strong> die meiste Relevanz</strong> hat das Thema <strong>Data Science</strong> im Moment <strong>wohl im Advertising-Markt</strong>: da geht es häufig, um sehr schnelle Antworten, die nicht irgendwann zu beantworten sind, sondern genau dann wenn der Webseiten-Besucher auf meiner Seite ist. (Auch wenn es für meinen &#8216;Traum&#8217; von für mich sehr relevanter, weil auf meine Interessen zugeschnittene, Werbung wohl noch zu früh ist.)<br />
- Kurz danach kommt das Thema von <strong>unstrukturierter Information</strong>: hier fällt mir auf, daß z.B. im Vergleich zu den USA (wo ich im Februar auf der StrataConf war) das Thema hier wichtiger zu sein scheint.<br />
- Im Moment nutzen hauptsächlich<strong> junge und eher kleinere Unternehmen</strong> Kompetenz zu dem Thema Big Data/Data Science: in etablierten &amp; größeren Unternehmen, die zwar die Daten hätten (die sich ein Startup ziemlich mühsam sehr beschaffen oder erzeugen muß), ist das Thema wohl noch nicht angekommen.<br />
- Aus der Perspektive eines Startupgründers ist klar, daß a<strong>us eigenen Aktionen gewonnene Daten und Analysen essentiell</strong> (oder zumindest wichtig) <strong>für die Geschäftsentwicklung</strong> sind. Ein gutes Beispiel dafür sind die Münchner <a href="http://twitter.com/#!/10stamps">@10stamps</a> (kurz eine App digitale Stempelkarten), die vermutlich mittlerweile mehr Überblick über meinen Kaffeekonsum als ich selbst haben.<br />
- Das Thema <strong>Tools für BigData</strong> ist auch noch ein guter Platz, auf dem sich <strong>neue Startups</strong> ausprobieren können: hier haben für mich Ansätze die Nase vorne, die einem Analysten/Geschäftsentwickler die Tools selbst in die Hand geben, neue Einsichten zu gewinnen. (anstatt für jede neue Fragestellung einen Entwickler suchen zu müssen). Der nächste Entwicklungsschritt sind Tools, die dem Nutzer nicht mehr Zahlenwüsten (z.B. in Powerpoint-Form) vorsetzen, sondern (mutmasslich) neue Einsichten (wie Veränderungen in Mustern &amp; Trends) direkt in die Hand geben.</p>
<p>Zusammengefasst: ein Tag voll von neuem Information und Diskussionen: der nächste Data Science Day findet in 3-4 Monaten statt und wird sich schwerpunktmässig mit dem Thema <strong>Analytics in Online-Games</strong> befassen.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dakoller.wordpress.com/247/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dakoller.wordpress.com/247/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.dakoller.net&#038;blog=33883038&#038;post=247&#038;subd=dakoller&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.dakoller.net/2012/06/13/1-data-science-day-in-deutschland-ein-kurzer-review/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/ee0740804be17ca85d68e2cafd4fb989?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">dakoller</media:title>
		</media:content>
	</item>
		<item>
		<title>The R-Podcast Episode 6: Importing Data from External Sources</title>
		<link>http://blog.dakoller.net/2012/05/01/the-r-podcast-episode-6-importing-data-from-external-sources/</link>
		<comments>http://blog.dakoller.net/2012/05/01/the-r-podcast-episode-6-importing-data-from-external-sources/#comments</comments>
		<pubDate>Tue, 01 May 2012 06:55:48 +0000</pubDate>
		<dc:creator>dakoller</dc:creator>
				<category><![CDATA[data science]]></category>

		<guid isPermaLink="false">http://blog.dakoller.net/?p=244</guid>
		<description><![CDATA[R-bloggers viaThe R-Podcast Episode 6: Importing Data from External Sources. &#8230;is an excellent summary of how to get external data into R. I recently needed to get JSON-formatted data in R (as people argue that you should use JSON for most of the data transaction needs in the web area, even for e.g. log file [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.dakoller.net&#038;blog=33883038&#038;post=244&#038;subd=dakoller&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>R-bloggers</p>
<p>via<a href="http://www.r-bloggers.com/the-r-podcast-episode-6-importing-data-from-external-sources/">The R-Podcast Episode 6: Importing Data from External Sources</a>. &#8230;is an excellent summary of how to get external data into R.</p>
<p>I recently needed to get JSON-formatted data in R (as people argue that you should use JSON for most of the data transaction needs in the web area, even for e.g. log file content like in <a href="http://blog.treasure-data.com/post/21881575472/log-everything-as-json-make-your-life-easier " rel="nofollow">http://blog.treasure-data.com/post/21881575472/log-everything-as-json-make-your-life-easier </a>), where I found http://stackoverflow.com/questions/2617600/importing-data-from-a-json-file-into-r handy.</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dakoller.wordpress.com/244/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dakoller.wordpress.com/244/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.dakoller.net&#038;blog=33883038&#038;post=244&#038;subd=dakoller&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.dakoller.net/2012/05/01/the-r-podcast-episode-6-importing-data-from-external-sources/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/ee0740804be17ca85d68e2cafd4fb989?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">dakoller</media:title>
		</media:content>
	</item>
		<item>
		<title>Machine learning for identification of cars</title>
		<link>http://blog.dakoller.net/2012/04/23/machine-learning-for-identification-of-cars/</link>
		<comments>http://blog.dakoller.net/2012/04/23/machine-learning-for-identification-of-cars/#comments</comments>
		<pubDate>Mon, 23 Apr 2012 11:28:03 +0000</pubDate>
		<dc:creator>dakoller</dc:creator>
				<category><![CDATA[data science]]></category>
		<category><![CDATA[R]]></category>

		<guid isPermaLink="false">http://blog.dakoller.net/?p=242</guid>
		<description><![CDATA[This is a handy getting started guide for computer vision using R from e.g. surveillance cameras, as all the with R-bloggers: it contains the needed source code. Machine learning for identification of cars.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.dakoller.net&#038;blog=33883038&#038;post=242&#038;subd=dakoller&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>This is a handy getting started guide for computer vision using R from e.g. surveillance cameras, as all the with R-bloggers: it contains the needed source code.</p>
<p><a href="http://www.r-bloggers.com/machine-learning-for-identification-of-cars/">Machine learning for identification of cars</a>.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dakoller.wordpress.com/242/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dakoller.wordpress.com/242/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.dakoller.net&#038;blog=33883038&#038;post=242&#038;subd=dakoller&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.dakoller.net/2012/04/23/machine-learning-for-identification-of-cars/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/ee0740804be17ca85d68e2cafd4fb989?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">dakoller</media:title>
		</media:content>
	</item>
		<item>
		<title>&#8230;just answered: Where are the Semantic web incubators? Any thoughts on building an economic ecosystem for Semantic web to keep momentum enough to attract si</title>
		<link>http://blog.dakoller.net/2012/04/23/just-answered-where-are-the-semantic-web-incubators-any-thoughts-on-building-an-economic-ecosystem-for-semantic-web-to-keep-momentum-enough-to-attract-si/</link>
		<comments>http://blog.dakoller.net/2012/04/23/just-answered-where-are-the-semantic-web-incubators-any-thoughts-on-building-an-economic-ecosystem-for-semantic-web-to-keep-momentum-enough-to-attract-si/#comments</comments>
		<pubDate>Mon, 23 Apr 2012 10:58:22 +0000</pubDate>
		<dc:creator>dakoller</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">https://dakoller.wordpress.com/?p=240</guid>
		<description><![CDATA[My basic message is: as long as your startup wants to use semantic tools/infrastructures (vs. providing tools and infrastructures) you likely not need a specific incubator, as semantic web just influences the tech part of your startup. Semantic&#160;Web: Where are the Semantic web incubators? Any thoughts on building an economic ecosystem for Semantic web to [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.dakoller.net&#038;blog=33883038&#038;post=240&#038;subd=dakoller&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>My basic message is: as long as your startup wants to use semantic tools/infrastructures (vs. providing tools and infrastructures) you likely not need a specific incubator, as semantic web just influences the tech part of your startup.</p>
<p>Semantic&nbsp;Web: Where are the Semantic web incubators? Any thoughts on building an economic ecosystem for Semantic web to keep momentum enough to attract sizable investment? 1 answer on Quora</p>
<p><span class="qlink_container"><a href="http://www.quora.com/Semantic-Web/Where-are-the-Semantic-web-incubators-Any-thoughts-on-building-an-economic-ecosystem-for-Semantic-web-to-keep-momentum-enough-to-attract-sizable-investment">Where are the Semantic web incubators? Any thoughts on building an economic ecosystem for Semantic web to keep momentum enough to attract sizable investment?</a></span></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dakoller.wordpress.com/240/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dakoller.wordpress.com/240/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.dakoller.net&#038;blog=33883038&#038;post=240&#038;subd=dakoller&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.dakoller.net/2012/04/23/just-answered-where-are-the-semantic-web-incubators-any-thoughts-on-building-an-economic-ecosystem-for-semantic-web-to-keep-momentum-enough-to-attract-si/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/ee0740804be17ca85d68e2cafd4fb989?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">dakoller</media:title>
		</media:content>
	</item>
		<item>
		<title>How to work with Google n-gram data sets in R using MySQL</title>
		<link>http://blog.dakoller.net/2012/04/12/how-to-work-with-google-n-gram-data-sets-in-r-using-mysql/</link>
		<comments>http://blog.dakoller.net/2012/04/12/how-to-work-with-google-n-gram-data-sets-in-r-using-mysql/#comments</comments>
		<pubDate>Thu, 12 Apr 2012 17:04:08 +0000</pubDate>
		<dc:creator>dakoller</dc:creator>
				<category><![CDATA[data science]]></category>
		<category><![CDATA[google]]></category>

		<guid isPermaLink="false">http://blog.dakoller.net/?p=234</guid>
		<description><![CDATA[How to work with Google n-gram data sets in R using MySQL. via R-Bloggers: I like really much about this blog the focus interesting things along with code examples to try it out on your own. N-Grams datasets can also be created from your own texts using NTLK functions (see http://nltk.googlecode.com/svn/trunk/doc/howto/collocations.html ): in analytical use cases N-grams [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.dakoller.net&#038;blog=33883038&#038;post=234&#038;subd=dakoller&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.r-bloggers.com/how-to-work-with-google-n-gram-data-sets-in-r-using-mysql/">How to work with Google n-gram data sets in R using MySQL</a>. via R-Bloggers: I like really much about this blog the focus interesting things along with code examples to try it out on your own.</p>
<p>N-Grams datasets can also be created from your own texts using NTLK functions (see <a href="http://nltk.googlecode.com/svn/trunk/doc/howto/collocations.html">http://nltk.googlecode.com/svn/trunk/doc/howto/collocations.html</a> ): in analytical use cases N-grams give you a better basis to have a machine &#8216;understand&#8217; the meaning of a text (compared to looking at the words individually.</p>
<p>&#8212;Update from 2012-04-10:</p>
<p>Stefan Keller ( <a title="@sfkeller" href="http://twitter.com/sfkeller">http://twitter.com/sfkeller</a>  ) hinted me to a<a href="http://t.co/CYVayPFU"> blog entry about how to use n-grams in a PostgreSQL based setting</a> to optimize search functionality.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dakoller.wordpress.com/234/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dakoller.wordpress.com/234/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.dakoller.net&#038;blog=33883038&#038;post=234&#038;subd=dakoller&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.dakoller.net/2012/04/12/how-to-work-with-google-n-gram-data-sets-in-r-using-mysql/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/ee0740804be17ca85d68e2cafd4fb989?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">dakoller</media:title>
		</media:content>
	</item>
		<item>
		<title>knitR: Report-Generierung mit R</title>
		<link>http://blog.dakoller.net/2012/04/12/knitr-report-generierung-mit-r/</link>
		<comments>http://blog.dakoller.net/2012/04/12/knitr-report-generierung-mit-r/#comments</comments>
		<pubDate>Thu, 12 Apr 2012 05:54:45 +0000</pubDate>
		<dc:creator>dakoller</dc:creator>
				<category><![CDATA[data science]]></category>
		<category><![CDATA[datascience]]></category>
		<category><![CDATA[dsc]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[tooling]]></category>

		<guid isPermaLink="false">http://blog.dakoller.net/?p=225</guid>
		<description><![CDATA[<p><a href="http://yihui.name/knitr/" title="knitR: Report-Generierung mit R">knitR: Report-Generierung mit R</a></p><p>In Projekten zur Datenauswertung ist es häufig relevant, die Ergebnisse zeitnah ansprechend dokumentieren zu können. Ein gute Weg dabei, den Fortgang der Experimente und deren Ergebnisse in einem dynamisch generierten Dokument zu verfolgen: dabei kann knitR helfen.</p><p>knitR bügelt einige Schwächen der Sweave-Lösung (<a href="http://www.statistik.lmu.de/~leisch/Sweave/">http://www.statistik.lmu.de/~leisch/Sweave/</a> ): insbesondere können gut aussehende Grafiken besser eingebaut werden.  </p><p>Weitere Infos zum Tool finden sich unter <a href="http://www.inside-r.org/howto/knitr-elegant-flexible-and-fast-dynamic-report-generation-r">http://www.inside-r.org/howto/knitr-elegant-flexible-and-fast-dynamic-report-generation-r</a> , eine Beispielausgabe ist unter <a href="http://cloud.github.com/downloads/yihui/knitr/Stat615-Report1-Yihui-Xie.pdf">http://cloud.github.com/downloads/yihui/knitr/Stat615-Report1-Yihui-Xie.pdf</a> zu sehen.</p><img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.dakoller.net&#038;blog=33883038&#038;post=225&#038;subd=dakoller&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://yihui.name/knitr/" title="knitR: Report-Generierung mit R">knitR: Report-Generierung mit R</a></p>
<p>In Projekten zur Datenauswertung ist es häufig relevant, die Ergebnisse zeitnah ansprechend dokumentieren zu können. Ein gute Weg dabei, den Fortgang der Experimente und deren Ergebnisse in einem dynamisch generierten Dokument zu verfolgen: dabei kann knitR helfen.</p>
<p>knitR bügelt einige Schwächen der Sweave-Lösung (<a href="http://www.statistik.lmu.de/~leisch/Sweave/">http://www.statistik.lmu.de/~leisch/Sweave/</a> ): insbesondere können gut aussehende Grafiken besser eingebaut werden.  </p>
<p>Weitere Infos zum Tool finden sich unter <a href="http://www.inside-r.org/howto/knitr-elegant-flexible-and-fast-dynamic-report-generation-r">http://www.inside-r.org/howto/knitr-elegant-flexible-and-fast-dynamic-report-generation-r</a> , eine Beispielausgabe ist unter <a href="http://cloud.github.com/downloads/yihui/knitr/Stat615-Report1-Yihui-Xie.pdf">http://cloud.github.com/downloads/yihui/knitr/Stat615-Report1-Yihui-Xie.pdf</a> zu sehen.</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/dakoller.wordpress.com/225/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/dakoller.wordpress.com/225/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.dakoller.net&#038;blog=33883038&#038;post=225&#038;subd=dakoller&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.dakoller.net/2012/04/12/knitr-report-generierung-mit-r/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/ee0740804be17ca85d68e2cafd4fb989?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">dakoller</media:title>
		</media:content>
	</item>
	</channel>
</rss>
