Afreen: FluidOps Provides Better Data from Multiple Sources with Semantic ...

শনিবার, ৮ সেপ্টেম্বর, ২০১২

FluidOps Provides Better Data from Multiple Sources with Semantic ...

The market that NoSQL addresses is quite wide and populous. It includes not only databases but also utilities to accelerate data collection, analytics, and visualization. The whole idea of Big Data is to derive useful intelligence and information from the vast amount of data that were ignored and discarded before. So in a way, it is data mining and business intelligence. But Big Data is different in the magnitude of its volume, velocity, and variety. In the enterprise market, most data in question are in known formats (structured), and their variety is limited. Also, it is rare that a vast amount of data comes in real time. But this is changing now because of SNS and the mobile computing invasion.

Fluid Operations (FluidOps) aggregates data from different sources and convertis them with some intelligence for better analysis. I sat with Peter Haase, senior architect, and chatted about their Information Workbench, a comprehensive tool for collecting and analyzing data and visualizing useful information.

Peter Haase

Fluid Operations is located in Walldorf, Germany. SAP?s headquarters is there as well. They currently have no US office, but their website provides information in both German and English. Peter and other people from the company are fluent in English.

As in other areas, in the power business, utilities companies collect and aggregate various kinds of data in addition to meter-read data. They may monitor equipment on the distribution grid, such as transformers, switches, relays, and capacitor banks. The data from the equipment and the meter-read data may be generated at dramatically different speeds. In addition to dynamic and real-time data, some static data types like asset information, including equipment location, brand, model, specification, and service records, may be required to provide preventive maintenance and report malfunctions and failures. The FluidOps solution is to collect and aggregate data from multiple sources and then to translate each datum semantically to a common form so that it has more meaningful information associated with it. Since all the translated data are in the same form with more meaningful relationships among them, analytics becomes more effective and can lead to more appropriate action.

?Semantically? means that they convert collected data into their normal form, which is represented using the Resource Description Framework (RDF). In this framework, each name is represented by a Uniform Resource Identity (URI). The following example shows amyloid precursor protein (APP, its URI is http//bio2rdf.org/uniprot:P05067) and Alzheimer?s disease (its URI is http//bio2rdf.org/omini:104300). APP is said to have close relationships with Alzheimer?s. The RDF form of these two entities is shown in the following slide. All the data collected are converted into this format. The query language for RDF is SPARQL Protocol and RDF Query Language (SPARQL).

FluidOps Information Workbench consists of data integration and storage, data management, and presentation/interaction/UI customization layers. At the 30,000-foot view, it collects and associates data using semantic models from diverse industry segments. For example, the Linking Open Data Community project is an attempt to make data from different industry segments freely available, and for that, data are represented in RDF. The segments include media, geographic, publications, user-generated, governments, and life science. Their relationships are shown in the following diagram, which is maintained by Richard Cyganiak and Anja Jentzsch.

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch: available here. Data published in Linked Data format based on the Linking Open Data Community project.

Click each circle on the figure here (not the figure above) to drill down through each dataset.

The following figure illustrates how Information Workbench collects and associates data with other data to increase their value semantically.

The disparate sources include tweets, Facebook, YouTube, data.gov, office documents, and various video files.

The architecture of Information Workbench is shown below. It consists of a data integration and storage layer (green), data management (brown), and presentation, interaction, and UI customization (blue).

Fluid Operations looked at the availability of RDF datasets to exploit for effective analytics. Their current application areas include media and health care and life sciences. I asked Peter about its application to the power industry. He said they were not looking into that yet may consider it if they get a research grant. I do not know whether a dataset is already available for the power industry, but I think it might help the industry to exploit something like this.

I talked about each utility?s operation, but if we look at each region, such as ISO/RTO, the regional power balance information and data are very useful. I would like to follow this as it grows.

About Zen Kishimoto

Dr. Zen Kishimoto is in charge of Green IT at Alta Terra. His broad technology background and diverse functional roles at individual-contributor and executive levels in large corporations and start-ups is a strong basis for conducting research in the greening of IT. Both strategic and tactical insights based on these experiences are necessary to make IT and its related technologies greener, since both a holistic and component-level view are necessary. This is specifically so in his first area of concentration?data centers?in which a large number of software, computer hardware, and networking components as well as facility elements are interrelated and configured in a complex manner. For over 25 years, Zen was involved in various technology areas as a user and a vendor, including software development methodologies/process/tools, Open Source Software (OSS), Internet/Network security, embedded software/systems, networking, Web, VoIP and to name a few. Based on exposure to those multiple technology areas, he can take a view from the perspectives of a user and a vendor of each technology as necessary. After working for Fortune 100 companies, Zen has been a successful entrepreneur and software business consultant specializing in product management, turning technologies to viable business and covering each phase of product management. This includes market research, technology assessment, project management, technical marketing, promotion, product launch, business development and sales. In addition, he produced numerous research papers for his clients in the areas of software and telecommunication as a consultant. In addition, Zen, originally from Japan, has a web of business contacts and relationships in Japan and is keen on the green IT/Technology market outside of the US, bridging language, culture and business practice for his clients. As greening of IT and its related technologies require a global view, he can give appropriate advices and comments not confined to the US domestic view but global ones for his clients. Finally, before joining Alta Terra, he has played CTO, COO and other executive roles in Silicon Valley startups, including Cardsoft. Earlier he served as functional general manager and Senior Director at NEC Technologies, where he started the Internet business unit. He has held technical positions at NEC, Hewlett Packard and GTE. He is also the principal of IP Devices, a software business and market research consultancy specializing in IT infrastructure.

Source: http://tek-tips.nethawk.net/fluidops-provides-better-data-from-multiple-sources-with-semantic-modeling/

hungergames bagpipes aspirin aspirin 21 jump street illinois primary results acapulco mexico

Afreen

শনিবার, ৮ সেপ্টেম্বর, ২০১২

FluidOps Provides Better Data from Multiple Sources with Semantic ...

About Zen Kishimoto

কোন মন্তব্য নেই:

একটি মন্তব্য পোস্ট করুন

ব্লগ সংরক্ষাণাগার

আমার সম্পর্কে