Big Data Proving Return on Investment

A recent article from IT World describes the advantages of utilizing and leveraging big data and pushes small businesses to take advantage of the benefits big data has to offer. A “recent survey by Forrester found that 7% of IT executives and 9% of business leaders feel they have gained a true return on investment from big data,” proving that some companies have jumped on the band wagon, and they’re not all big powerhouses.

big-dataBefore, there wasn’t a platform to hold massive amounts of data but now as the article Big Data is big but a young market notes there is an option and it’s a cost effective one as well, “perhaps the biggest-name platform for handling this data is Hadoop, an open source distributed computing architecture that excels in handling massive data sets … the type of data analytics that used to only be available to the biggest companies in the world can now be done by small companies. The rise of cloud-delivered services … plus the continued drop in the price to store information, combined with a management platform like Hadoop to manage it, means more businesses are holding on to more data …it gives an opportunity for the little guys to catch up to the big guys.”

Companies can and should be taking advantage of  ”these two trends — a massive increase in the amount of data available to businesses, and a platform for handling it.” If done correctly companies and organizations “can save the data in case they find a use case for it down the line. Having all that data, plus a platform to analyze it, means companies can run queries based on a whole set of data, instead of before just taking a subset of data to analyze.”

Our core product, Savanna, a multi-source analysis solution, leverages the data storage and processing capabilities Hadoop provides. With Savanna’s suite of analytical tools, we can help you make sense of the stored information.

Big Data – A Resource in Analysis and Decision-making

Yesterday morning the Technology Association of Oregon hosted a Technology Forum titled Big Data Now – but How to discuss the opportunities and challenges that come with this exponential growth of data, and mainly how do we use this data effectively? Three speakers from the top local analytics companies helped lead the discussion, one being Thetus’ Data, Analysis and Training team lead, Dan Vizzini. Below is a summary of Dan’s presentation on the perspective that Big Data is merely a resource in the service of good analysis and decision-making.

Focusing on big data is akin to the perspective of a sculler rowing upstream

Focusing on big data is akin to the perspective of a sculler rowing upstream

The arc of human development is fundamentally a reflection of the human capacity to gather, store, analyze and act upon information. Transformational leaps in society, economy, culture and science coincide with revolutionary changes in communications and information technologies. While computers and related technologies have radically expanded the human capacity for processing information (and has begun to replace humans entirely in some decision-making), the most important decisions in the life of human societies and enterprises remain in the hands of human beings. The pace of technologies, processes and regulations that drive the creation of big data is outdistancing the development of technologies, processes and practices required to extract understanding and insights in support of effective decision-making. The past is not necessarily prologue in a world beset with wickedly complex problems. Focusing on big data is akin to the perspective of a sculler rowing upstream.

Effective analysis looks beyond the bounds of known and knowable data; extending the reach of discovery, understanding and anticipation to unmapped domains of economic, social and ecological risk and opportunity.

Thetus software is built for this unmapped landscape of risk, opportunity and resilience. Thetus’ core product, Savanna, is a suite of analysis tools that encourage the formulation of knowledge models and tested hypotheses; enable the fusing of structured and unstructured data; and facilitate visualizations and fully-sourced productions for decision-makers.

Thetus provides solutions to organizations with complex and wicked problems, problems devoid of obvious pathways to solutions and subject to dynamic forces that require adaptable and resilient management strategies. Such problems require the kind of modeling, analysis and synthesis that only a well-trained and disciplined human being can perform. Our job is to facilitate that human intervention. Thetus begins with the problem, and considers big data as one type of resource in the hands of a competent team of analysts.

In the final analysis, Thetus answers the question of how to make the most of big data by asking… to what end? Our software facilitates the analysis to help our clients navigate their way to a desired reality by means of model-enabled and adaptive policies and practices. For Thetus, big data is an important resource to help human beings solve complex, wicked problems.

Below is information on the two other presenters.

Nitin Mayande, Co-founder and Chief Scientist of Tellagence – Tellagence is used to organize and analyze public and private data to empower businesses to improve the decision they make in marketing, operations, and investments.

Eimar Boesjes, Chief Operations Officer and Chief Technology Officer at Moonshadow Mobile – Moonshadow Mobile works with large databases, from tens of thousands to hundreds of millions of records to visualize “big data” over maps.

Freebase the Knowledge Base

freebaseWe love talking about tools that can make us more efficient and productive, especially when they’re free! A few of our developers and modelers have been discussing their use of Freebase, a community-driven knowledge base of metadata. Freebase is on the web and offers users a “global resource that allows people (and machines) to access common information more effectively.” Freebase provides a hefty compilation of structured data ranging from various sources, data available for free under a Creative Commons Attribution License, an open API, RDF endpoint and a database dump. The original idea generated by Metaweb, later acquired by Google, has been running publicly for the past six years and only requires registration for data contributors.

“Freebase runs on a database infrastructure created in-house by Metaweb that utilizes a graph model. This means that instead of using tables and keys to define data structures, Freebase defines its data structure as a set of nodes and a set of links that establish relationships between the nodes. Because its data structure is non-hierarchical, Freebase can model much more complex relationships between individual elements than a conventional database, and is open for users to enter new objects and relationships into the underlying graph.”

We at Thetus have some Freebase fans that can attest to it being a great resource for prototyping, idea generation and basic model workings.

Have you used Freebase before and what benefits have you found from using it?

Mass Text Edit

Who doesn’t hate the mind-numbing task of making the same minor edit to a giant list of text? Welcome to a world free of that tedium, check out this Mass Edit tool for automating text editing! This editor does simultaneous text editing for excel lists and Java scripts, see how it works or test it out here.

Screen Shot 2013-01-31 at 2.57.34 PM

Preparing for the National Day of Civic Hacking

safe_imageThat’s right, the White House blog also recognizes the power of do-gooder hacking and wrote this post, Roll Up Your Sleeves, Get Involved, and Get Civic Hacking about National Day of Civic Hacking.

They also noted in their post that “the event is taking place in conjunction with Random Hacks of Kindness and Code for America’s Brigade meetings and is being modeled after the Innovation Endeavors’ Super Happy Block Party.  A number of Federal agencies, including NASA, the Census Bureau, and the Department of Labor, are participating by offering specific challenges for hackers to work during the event.”

Cyber Vigilantism

An article from the LA Times, A new brand of cyber security: hacking the hackers, discusses a new company called CrowdStrike, that helps enterprises and government keep hackers at bay. Led by the former head of the FBI’s cyber crimes division, CrowdStrike ”stakes out networks to catch infiltrators, keeps dossiers on hackers and tricks them into stealing bogus data.”

This raises the debate of how far companies should go in defending themselves from cyber attacks, what do you think?

A checklist for evaluating model-enabled analysis solutions

If you’re in the market for a model-enabled analytical tool, read this first. In the white paper called Model-Enabled Analysis: Factors for Evaluation,  the authors from CTOlabs list nine mission-focused factors to take into account when evaluating model-enabled analysis solutions, see below.

• Mission functionality/capability
• Ease-of-use/interface
• Architectural approach
• Data architecture
• Modeling capability
• Licensing
• Partner ecosystem
• Deployment models
• Health of the firm

For a further explanation on each factor, download the white paper here.

Maven Made Easy

One of the most commonly used open source technologies in our office is Maven. Like any tool that is used by various people for various tasks, Maven can become frustrating at times. Luckily one of our employees was kind enough to share this little gem he found, Maven : The Complete Reference; here you’ll find a very thorough, all encompassing guide to Maven.

In a previous blog post titled Open Source Invite we shared our first Thetus-bred open source product built to simplify plug-ins for Maven. The Thetus Maven Toolbox drastically simplifies authoring certain types of plug-ins, check it out for yourself  https://bitbucket.org/Thetus/maven-toolbox

Universal Metric for Sustainability

Analyzing company’s sustainable initiatives and/or environmental impacts is often confusing for non-experts. Energy Points, a new startup, is helping ease this stress by translating various units of measure (like gallons, CO2 etc) to one simple, single score by using a points system that takes into account geospatial and temporal elements. Their unit of choice is the amount of energy it takes to generate one gallon of gasoline (energy per gallon), the benefit is “one to one comparison across resources.”  When it comes to analyzing sustainability, “the problem isn’t the availability of data, it’s the interpretation,” says Energy Points founder and CEO, Ory Zik. By looking at data with the same unit of measure, companies can now easily compare environmental impact factors by analyzing resource consumption through different methods like water usage or transportation type.

Cyberattacks and America

At a conference in Colorado, the head of the National Security Agency and the United States Cyber Command, General Alexander, publicly stated that there has been a “17-fold increase in computer attacks on American infrastructure between 2009 and 2011.”  These sort of attacks could effect our “electricity grids, water supplies, computer and cellphone networks and other infrastructure.” Read the recent New York Times article to see how America is equipped to handle these cyberattacks.

Follow

Get every new post delivered to your Inbox.