Category

Big data

Category

The term data mining is often used when it comes to the storage and management of information in the big data area. Many companies use data mining as a tool by enabling the systematic application of computer-based procedures to find patterns, trends and relationships within large databases. It builds on various findings from the fields of computer science, statistics and mathematics by performing analyzes of databases. These analyzes pursue the goal of finding connections, patterns, trends and relationships between information within large databases and making them usable. Data mining works in a purely automated manner, which results in both cost and time savings. Companies can then use the results provided to make decisions about strategies or problem solving more easily. Functions Data mining is mostly used for the achievement of several goals by companies. In order to achieve these goals, it has to do a variety of tasks. This includes:…

In the context of big data, one always needs powerful platforms that can efficiently store a large amount of data. Such a platform is also called a data warehouse. This analyzes the information it contains according to certain patterns. Data warehousing process The data warehousing process, which is often used to describe how it works, comprises four main main steps for analyzing data by managing the data in the data warehouse and evaluating it for results. The 4-stage analysis process of a data warehouse Acquisition of data from the source systemLoading the dataBackup of the dataAnalysis and evaluation of the stored data This is how a data warehouse is structured A data warehouse, like a real building, is basically a construct made up of several elements. The foundation is an operational database that contains a large amount of information. The so-called staging area, which has the task of pre-sorting the…

Data processing in the area of big data often poses great difficulties for many companies. To counteract this problem, many organizations use tools such as software-based frameworks. These also include Hadoop, which is connected to Java. What is Hadoop The Java-based software framework Hadoop can most easily be imagined as a kind of shell that can be tailored to the most varied of architectures and operated by a wide variety of workers, in this case the hardware. The framework was invented by Doug Cutting, who developed Hadoop into one of the best projects in the field of the Apache Software Foundation by 2008. Cutting developed the software framework for better management of distributed and scalable systems. It is based on the MapReduce algorithm from Google, which uses Hadoop to combine large amounts of data in detailed computing processes on distributed but networked computers. Hadoop is not only so popular, but…

In the age of digitization and big data, a lot revolves around one thing: data. Terms such as data mart and data lineage regularly catch the eye. It is not always clear exactly what the technical terms are, which is why this article is intended to provide a brief overview. What is a data mart? Data marts are a kind of collection point for user-defined data. In doing so, data is extracted from large data stocks and made accessible in isolation for certain user groups. They thus form a sub-segment of a data warehouse and can help to make certain data accessible to the user more quickly and with less effort. This not only saves time but also costs. Data Mart vs Data Warehouse Both data marts and data warehouses are used to store and manage data records until they are used. Data warehouses are specialized in organizing the entire…

Big data analyzes usually require a large amount of data in order to capture and collect all information in its raw state. This data storage resembles a real sea in size, which is why the technical term “data lake” has been established for it. You can find out exactly what this is all about in this article. definition As a large data store, the data lake manages the entire mass of data in its original form, i.e. in its raw format. He makes use of the collection of information from a wide variety of sources. It makes no difference to the data lake whether the data has a structure or not. This large data store also does not require any prior validation or reformatting of the data. However, a data lake cannot manage number or text-based data. In addition, it can also save information from the media area, such as…

Big data is an important topic. That already shows a study by Bitkom . In 2018, the association surveyed over 600 companies on trending topics and found the following results: 57 percent are planning investments in big data or are already being implementedThe five top topics are big data (57%), Industry 4.0 (39%), 3D printing (38%), robotics (36%) and VR (25%)But: New concepts and possibilities such as artificial intelligence and blockchain have only rarely been used so far Reading tip: What is big data Implementation of big data only hesitantly According to the study, the potential of big data is only being used hesitantly. According to the study, the reasons for this are the requirements for data protection (63%) and the technical implementation (54%) as well as a lack of specialists (42%). Reading tip: What is a data scientist I am currently working on the technical implementation. In order to…

Chatbots are on everyone’s lips and could significantly change the working world in companies. The use of chatbots, for example, offers a complete revolution in customer service for companies. On the other hand, this could also lead to massive job losses and also make numerous job profiles such as the call center agent no longer necessary. In the following article I would like to shed light on the current and following possibilities of chatbots. Klarmobil chatbot The first thing I did was look at the Klarmobil chatbot. According to the mobile operator, 1000 customers use the chatbot every week and ask almost 6 questions. In addition to the classic customer advisor, the bot answers a set of standard questions similar to the FAQ. WetterOnline chatbot I found another great use case at WetterOnline. There I can give the bot special instructions. These are for example show me the weather on…

In my research, in addition to a group discussion, I also collected the data. I carried out a quantitative survey for this. This is used for the quick collection of data that I have in the Round tables could evaluate together with the participants. In this article, I’ll give you some tips on how to do this. The following sections are taken from and quoted from my doctorate. Advantages and pilot test The advantage is that a high mass can be achieved quickly. The links to the questionnaire can be distributed specifically to already known participants and recommendations in their networks. In contrast to direct methods such as telephone interviews, the layout and design of the questionnaire is very important, as the researcher cannot provide explanations (Porst 2014). For example, a question can be misinterpreted. For this reason, my questionnaires were always tested with 5 pilot people, who then sent…

The large amount of data continues to grow. In fact, it is now being said that data is the new oil. At the same time, there is also a new job description. The name of the data scientist appears more and more. So says the portal SAS : “Anyone who knows how strategically important knowledge can be drawn from large amounts of data and can also convey this has a key position in the company as a consultant for top management.” But if you look at the job advertisements you will find a lot about it and you ask yourself: What does a data scientist do? This article is intended to provide information. Reading tip: What is big data What should a data scientist be able to do? If you look at the job advertisements, a data scientist should usually be able to do the following: Analytical talent Expertise communication…

“Big data creates mixed feelings for many people. The economic opportunities are obvious. But the possibilities of abuse are also evident ( Computer week ) “. Big data is certainly more than just hype and brings numerous new opportunities with it. However, there are also many risks, which are discussed in this article.Reading tip: What is big data Big Data Risks: Monitoring An example can be found in the Computer week when looking for a perpetrator on the autobahn: “The investigators had installed cameras on seven relevant sections of the autobahn. These read in the license plates of all passing automobiles, including those of the vehicles being shot at. In April 2013, the police received reports of gunfire on trucks again within five days, a total of six. “Of course, this massive storage of data allows a high level of surveillance. The verdict of Computerwoche: “The evaluation of massive amounts…

By continuing to use the site, you agree to the use of cookies. more

The cookie settings on this website are set to "Allow Cookies" to provide the best browsing experience. If you use this website without changing the cookie settings or click "Accept", you agree to this.

close