Skip to main content

Get Git

The requirements for the data science class that I'm taking include having access to R, Python, SQL and GitHub. As I'm already familiar with R, Python, and SQL, my goal tonight was to get started with GitHub. According to Wikipedia, "GitHub is a web-based hosting service for software development projects that use theGit revision control system. GitHub offers both paid plans for private repositories, and free accounts for open source projects. As of May 2011, GitHub was the most popular open source code repository site.[3]"

So, basically people come together on this website and share code... if you don't want to pay, you agree to have your code (in your "repository") available as open source. The future is Crowd Sourcing...

and speaking of the future, I think this chart from the GitHub website is an interesting reference for those looking to figure out what programming languages they should be working on if they want to remain competitive in the future:

Obviously these "stats" are quite dynamic (what people use when it comes to computing changes at near light speed); however, it's at least a starting point for people who want to begin diving into the world of computer programming. Also, the sample base of users is not exactly random (the people who are involved with GitHub are likely more technical than your average user), but if you want to enter the world of Comp Sci and contribute your great mind to the open source community, the above languages are likely worth your perusal. 

Personally, I have the most experience with Python (and R, though it's not listed here)... at least an intermediate level of knowledge. I've worked with Objective-C (I am a Mac user), and (a looooong time ago) I used to write a lot of JavaScript; also, I took a CS course in 2000 that was heavily based in C++ but I haven't touched that language much since about 2003. 

I'm interested to learn more about NoSQL and Tableau, which apparently we will be introduced to in this class...

woot woot!
-time for bed


Popular posts from this blog

Downloading a Video from Facebook

Hey... here's a neat trick for downloading a video from Facebook:

First, notice that your "Saved videos" can usually be found in the upper right menu (where birthdays usually show up) or under the "Saved" button in the left menu.

Once you click on the saved video, you'll find the file... click on that bad boy to play it in the browser.

Once it's playing, notice that there's no way to download it (that's what we're gonna "hack")

In the URL of the playing video, replace www with m (for mobile)

You should be brought to the funky looking mobile site (even though you're on a regular browser)

Play the video and then right click, you should see the following menu:

Save it as an MP4, crack a beer, and enjoy your Video whenever you want without having Facebook open.

Big Data as Big Brother

Yesterday and today, it seems like one of the biggest news items is the Big Data of Big Brother. 

The argument goes like this:

The U.S. government is aggregating the same kind of social media and online user data that private companies use to understand their customers' sentiments (or potential customers) for the express purpose of counter terrorism, reporting on potential defense threats, and generally trying to figure out who the "bad guys" in the world are...

Using data from companies such as Apple, Google, Facebook, Microsoft, Skype, Yahoo, YouTube, and others, the National Security Agency is able to obtain all types of data (1). Now, when one considers the total size of this data it's clear to see that this exits the world of statistical analysis and enters the world of Data Science and Big Data. 

To give you an idea of the size of the data, "The amount of data in question is enormous. For example, the U.S. wireless-communications trade association C.T.I.A. est…

Life Statistics

Today I used Excel to analyze total email volume and client specific email volume data (sent/received) weighted by an "effort" ranking to help highlight work flow at the office. In the interest of not having to check my email as much as I am currently, I hoped to shed light on prioritization of time spent with email vs projects. 

My hope is to only check my email twice a day... But, given the client service that my role entails, that's probably not gonna happen. One thing that came out of my analysis though is when getting the most email (day of week/time of day) is most probable. It turns out Tuesday mornings and Friday afternoons are the most likely times for stuff to hit the proverbial "fan."

In other life statistics news (and productivity measures), I'm tracking my time spent during the day in a manner similar to the "hyper tracking" concept I've learned from listening to The Better Guy Show podcasts (available on iTunes). In the next week I…