Although a lot of specialists argue in favor of Python, Java is also required for data analytics.
I-banks actually prefer java for building enterprise systems. Many Big Data systems are developed in Java or created to run on JVM. The stack may include the following tools:
- Spark is used to stream data and distribute batch.
- Kafka – to queue huge volumes of information.
- Spring Boot – to provide system’s options to the customers via REST API.
data vs information — I
feel for high volume, high reliability, low-level “data” handling, java (and C++) are more suitable. For high-level “information” analysis, python and R are more suitable. However, in reality m /feel/ might be wrong. Python might have big frameworks comparable to java’s.