Skip to main content

Get Git

The requirements for the data science class that I'm taking include having access to R, Python, SQL and GitHub. As I'm already familiar with R, Python, and SQL, my goal tonight was to get started with GitHub. According to Wikipedia, "GitHub is a web-based hosting service for software development projects that use theGit revision control system. GitHub offers both paid plans for private repositories, and free accounts for open source projects. As of May 2011, GitHub was the most popular open source code repository site.[3]"

So, basically people come together on this website and share code... if you don't want to pay, you agree to have your code (in your "repository") available as open source. The future is Crowd Sourcing...

and speaking of the future, I think this chart from the GitHub website is an interesting reference for those looking to figure out what programming languages they should be working on if they want to remain competitive in the future:

Obviously these "stats" are quite dynamic (what people use when it comes to computing changes at near light speed); however, it's at least a starting point for people who want to begin diving into the world of computer programming. Also, the sample base of users is not exactly random (the people who are involved with GitHub are likely more technical than your average user), but if you want to enter the world of Comp Sci and contribute your great mind to the open source community, the above languages are likely worth your perusal. 

Personally, I have the most experience with Python (and R, though it's not listed here)... at least an intermediate level of knowledge. I've worked with Objective-C (I am a Mac user), and (a looooong time ago) I used to write a lot of JavaScript; also, I took a CS course in 2000 that was heavily based in C++ but I haven't touched that language much since about 2003. 

I'm interested to learn more about NoSQL and Tableau, which apparently we will be introduced to in this class...

woot woot!
-time for bed


Popular posts from this blog

Downloading a Video from Facebook

Hey... here's a neat trick for downloading a video from Facebook:

First, notice that your "Saved videos" can usually be found in the upper right menu (where birthdays usually show up) or under the "Saved" button in the left menu.

Once you click on the saved video, you'll find the file... click on that bad boy to play it in the browser.

Once it's playing, notice that there's no way to download it (that's what we're gonna "hack")

In the URL of the playing video, replace www with m (for mobile)

You should be brought to the funky looking mobile site (even though you're on a regular browser)

Play the video and then right click, you should see the following menu:

Save it as an MP4, crack a beer, and enjoy your Video whenever you want without having Facebook open.

Data Science Course (Complaint # 2?)

Looks like I'm not the only one having problems with the first assignment for UW's Introduction to Data Science.... while Professor Howe has certainly whetted my appetite for wanting to learn more useful DS than I've learned before, the first assignment seems like a weed-out assignment. I remember from my economics days that this is actually a fairly common practice among econometricians (thanks Professor Levy). [Side note: I'm STILL working my way through Data Analysis and Regression: A Second Course in Statistics.]

I've submitted most of what was required in a Python file for A1P2; however, it's probably going to get a 0/10. I fully intend on sticking with this course after reading a lot of the complaints. Most of the people complaining seem to have the "must have now" mentality; fortunately my years of dealing with this kind of stuff is that it rarely comes easy... that said, my intent to blog along with this course is going to be less sporadic and …

Big Data as Big Brother

Yesterday and today, it seems like one of the biggest news items is the Big Data of Big Brother. 

The argument goes like this:

The U.S. government is aggregating the same kind of social media and online user data that private companies use to understand their customers' sentiments (or potential customers) for the express purpose of counter terrorism, reporting on potential defense threats, and generally trying to figure out who the "bad guys" in the world are...

Using data from companies such as Apple, Google, Facebook, Microsoft, Skype, Yahoo, YouTube, and others, the National Security Agency is able to obtain all types of data (1). Now, when one considers the total size of this data it's clear to see that this exits the world of statistical analysis and enters the world of Data Science and Big Data. 

To give you an idea of the size of the data, "The amount of data in question is enormous. For example, the U.S. wireless-communications trade association C.T.I.A. est…