Home > Software design >  Need help in deciding between Apache flink and apache spark
Need help in deciding between Apache flink and apache spark

Time:04-21

We are trying to decide to use either Apache Spark or Apache Flink for our projects. I have the below questions. Need help in finding the answers.

  1. I see that most of features of Spark are covered in Flink, except for The Fair scheduling of Spark. I tried googling and going through flink documentation but had no luck.
  2. Also if you see Github Apache spark has almost double the popularity(no of stars, Forks) when compared to flink. So I am curious to know why Flink doesn't have as much popularity as Spark.
  3. I also see the number of connectors written to flink are too less/ less maintained than the number of connectors to Spark. (eg: mongodb). Does it mean flink is yet to be matured/ get market traction?

The answers to the above will help us decide on the appropriate technology.

Ravi

CodePudding user response:

I think you really know flink more detail,so,you can get the answer

CodePudding user response:

As always, choosing for a technology depends on the best fit for the business problem that you're trying to solve. I can't answer that based on the original question. Are you interested in batch or streaming data? Is latency important? Do you want to perform stateful processing or stateless? How about exactly once state consistency? What are the sources and what are the sinks that you need to connect to? Do you want to write your logic in Java, Scala, SQL, Python or something else? These are just some of the questions you need to consider.

  • Related