All posts by Inderjeet

Inderjeet is the founder and main author of Tech2view.com and writes about Technology, SEO, social media and tech related tips and tricks. Apart from blogging he works full time as a Database developer. He loves to explore more about Internet based businesses and wants to own/make one. Twitter handle: http://twitter.com/singhinder

What is Data Mining and Techniques involved

data-mining

 

 

 

 

 

 

 

Data mining is the process of analysing data collected from different sources into useful information. It can be used to increase business profits and keep a track on the company’s performance. Data mining software helps users to analyse data from different angles. The data is categorized, summarized and correlated with large relational databases. The main aim is to get a view of the company’s performance. With the help of data mining, business owners can get a broad view on various trends. It can show the areas where there is more profit and where there is loss. It can reveal the ways how to control the unnecessary expenditure and increase the total income. Continue reading

Top 3 NoSQL Databases Features

A NoSQL database provides an effective mechanism for storage and data retrieval. These databases excel both in terms of speed and volume. Moreover, it is increasingly been considered a viable alternative to relational databases. While working with a huge quantity of data, NoSQL is very useful. Let us discuss some of the top NoSQL databases and their performance in data management systems:

NoSql Databases

Apache Cassandra

Apache Cassandra has been primarily designed to handle huge amounts of data spread out across commodity servers. It was developed by Facebook in 2008 as an open source distributed database management system. It helped Facebook to power up their inbox search feature. It offers the mixture of a column-oriented database and the columns are grouped into families. It provides a highly available service with its massive amount of consistency and replication. Apache Cassandra is preferred by companies as a back-end database. The most amazing features include BigTable modeling and the Gossip protocol. Some of the strong points favouring Apache Cassandra are highly available with no single point of failure, NoSQL column family implementation, flexible scheme, SQL like query language, support search through secondary indexes and support for replication. Apache Cassandra is a perfect solution and it provides scalability & performance needed in most applications. Continue reading

What is Big Data? Importance and Applications

Big Data refers to the large datasets that may become difficult to work with. In some cases, the size has been known to rise to 50 TB or multiple petabytes. The technology has enabled us to handle big data in an intelligent way. We can analyse and interpret the data for its use in different industries. The only risk is that we don’t focus on the quality of data analysed. It can have many uses such as basis of product sentiment analysis, thereby leading to product improvements. It has been estimated that 90% of big data in the world has been collected within the last 2-3 years.  

Big Data

Modern social media platforms

Big data forms the basis of most popular social media platform i.e. Facebook. It collects more than 500 TB of data every day. This data comprises of likes, status updates, photo uploads and all related interactions. Similar is the case with blogging site Twitter which collects more than 10 TB of data per day. Continue reading

Making Sense of Data – Free Course from Google

This is a good course offered by Google for data professionals and data enthusiasts. The course is self paced and divided into lessons and activities. Activities are small quiz which will help the learner to practice theory. At the end of the course there will be final project, upon successful completion of which certificate is also provided.

The course revolves around using Google Fusion Tables, which is free online software. The concepts taught is this free course are useful in general data process of data preparation, analysis and withdrawing conclusion from analysis. The course will run from March 18th to April 4th, 2014.

Making sense of Data

Link: https://datasense.withgoogle.com/

Big Data

Learn BigData – Top 5 Free Courseware Sources on the Web

Human kind has accumulated data for the past century. And now the same data has revealed new truths every day. With recent advancements in data mining new patterns are being discovered every day. BigData is THE place to be for any IT individual as it is the most popular area of research these days. Hadoop, R, MapReduce, MongoDB, CouchDB are some of the tools used for big data analysis. A lot of new technology has been developed using BigData concepts, like Facebook Friend Finder. If you are any business analyst or an IT professional dealing with databases then you need to make yourself aware about this technology trend immediately. And you can do all of this for FREE! I hope I got your attention now. So here are top 5 sources on the web to teach yourself about BigData:

big data

Big Data

  1. CourseEra: courseera.com is the best web source for learning about Bigdata and other subject in computer science. It has courses from best universities of the world. Moreover, it also provides you with valid certification from those universities for a nominal fee. However, if you just want to learn you can take the courses for free. Any course includes video tutorials, assignments, discussion forums, projects, quizzes, exams etc. The quality is obviously excellent.  Continue reading
chromebook2

Google Chromebook 2: Now in an ‘Almost-Laptop’ Version

chromebook2

After Android, Google broached another domain where it had not been before with its Chrome OS; now Microsoft is not the only company making low end, cheap notebooks which can be used for video chatting and light internet surfing etc. Previously, Chromebook was targeted as a solution for low end users who intended to use apps available on-line. The screen was low resolution and the processor had limited power. The laptop started instantly and was quite user friendly. However, nobody counted it as a ‘real’ laptop. It was merely half a step above a tablet. But now Google is soon to release Chromebook 2 with its partner Samsung and everyone thinks it will be almost a real laptop.

Continue reading

Samsung Galaxy S5: Survival of the Fittest?

With the release of Android mobile OS, Google opened a floodgate of new mobile technology. With the release of newer and better devices every day from various companies it has been ensured that mobile technology becomes as pervasive as possible. I don’t know about anyone else but my phone is surely the center of my world. The market of smart phones has become so competitive that a company can’t survive just by creating ‘good’ products. The products must have something extra that makes people want it.

Samsung’s latest release has taken the cake for me in that regard. With the release of Samsung Galaxy S5 along with Gear 2 and Gear Fit, the company has created a buzz in the smart phone market. And why should it not when the features are awesome! As I write this article, I have no idea which feature I should deem to be the best. From looks to functionality the phone is a top notch premium piece of technology.

Here’s a look at the features:

With a unique perforated back cover and four different vivid colors, the phone looks amazing and has a comfortable grip too. It has a 5.1 inch full HD super amoled screen with adaptive display which makes video viewing an experience of a lifetime. For photography enthusiasts, the phone offers at 16 megapixel rear camera with autofocus feature. Samsung claims the speed of autofocus to be 0.3 seconds. It has both contrast autofocus and phase detection autofocus which is a world first in smart phones. Another feature is the selective focus which blurs the background and lets you focus on one specific object. Continue reading

Git-Logo-2Color

Useful Git workflows and Tips

These are some workflows which I find useful while working at my job. Hopefully they can save you some time while you wrap your head around around Git basics.

Git branching: The commands below checks out the new feature branch from the master. Then so some work and do commits. Then checking out master branch  and merge to the master branch.

  • git checkout feature_xyz
  • git commit -a -m “I am committing now”
  • git checkout master
  • git merge feature_xyz

Pushing a feature branch to the Git repository:

  • git checkout feature_x
  • git push origin feature_x

Amending the last commit:
git commit –amend -m “New commit message”

Generating new keys and adding them – Client side

  • ssh-keygen -t rsa -C “your_email@example.com”
  • ssh-add your_name

If you get the error “Could not open a connection to your authentication agent” in Git Bash. Run the following commands to troubleshoot the problem:

  • ssh-agent
  • eval “$(ssh-agent)”
  • ssh-add your_name

If you want to permanently add keys, so that Git bash recognizes them automatically when pushing changes to the repo. Rename private and public keys to id_rsa and id_rsa.pub

On the machine where you are pushing changes (Server):
Add your_name.pub (id_rsa.pub) public key under home/your_home_directory/.ssh/
Copy id_rsa.pub file to authorized_keys file

Reverting back to particular commit
git revert ‘commit-id’

Resetting the repository to a particular commit:
git reset –hard ‘commit id’

Removing/adding (synchronizing) branches from remote: (Very useful)
git fetch -p

Cleaning up untracked files:

git clean -f

Using git mergetool (I usually use tortoise merge http://tortoisesvn.tigris.org/TortoiseMerge.html)
git mergetool [--tool=]

Fetches any new changes from the original repository:

git merge upstream/master

Creating an empty branch:

git checkout –orphan <branchname>

Deleting the remote branch:

  • git branch -d (local branch)
  • git push origin :<branchName> (remote branch)

Before pushing the feature branch to the git repository, its a good practice to get all your changes in one commit. For this, I do the following:
git rebase –i HEAD~(number of commits) // replace (number of commits with the number of commits I made to my branch)