Apache Ranger UserSync Configuration HELP!!
Reddit » Hadoop
by /u/Clean-Mix-6909
2h ago
I am trying to configure Apache ranger usersync with unix ! and Iam stuck at this point !: After i execute this : sudo JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64/ ./setup.sh Then this error pops up: teka@t3:/usr/local/ranger-usersync$ sudo JAVA_HOME=/usr/lib/jvm/java-8-openjdk-arm64 ./setup.sh [sudo] password for teka: INFO: moving [/etc/ranger/usersync/conf/java_home.sh] to [/etc/ranger/usersync/conf/.java_home.sh.28032024144333] ....... Direct Key not found:SYNC_GROUP_USER_MAP_SYNC_ENABLED Direct Key not found:hadoop_conf Direct Key not found:ranger_base_dir Direct Key not found:USERSYNC_P ..read more
Visit website
Hive Shell Issues
Reddit » Hadoop
by /u/ReporterNo8873
1w ago
submitted by /u/ReporterNo8873 [visit reddit] [comments ..read more
Visit website
Help with setup in MAC
Reddit » Hadoop
by /u/83here
1w ago
Hi guys, I have been trying to run Apache Hadoop (3.3.1) on my M1 Pro machine and I have been getting this error of " Cannot set priority of namenode process XXXXX ". I understand that MacOS is not allowing background process to be invoked. Is there any possible fix to this guys? submitted by /u/83here [visit reddit] [comments ..read more
Visit website
Namenode Big Heap
Reddit » Hadoop
by /u/krusty_lab
1w ago
Hi guys, Long Story short, running a big hadoop cluster, lots of files. Currently the namenod has 20GB of Heap almost full the whole time, some long Garbage cycles freeing up little to no memory. Is there anybody who is running Namenodes with 24 or 32 GB of heap. is there any particulare tuning needed ? Regards submitted by /u/krusty_lab [visit reddit] [comments ..read more
Visit website
[Hiring] Big Data Engineer with Spark (located in Poland)
Reddit » Hadoop
by /u/ComprehensiveSell578
2w ago
Scalac | Big Data Engineer (with Spark) | Poland | Gdańsk or remote | Full time | 20 000 to 24 000 PLN net/month on B2B (or equivalent in USD/EUR) Who are we looking for? We are looking for a Big Data Engineer with Spark who will be working on an external project in the credit risk domain. You should have expertise in the following technologies: - At least 4 years of experience with Scala and Spark - Excellent understanding of Hadoop - Jenkins, HQL (Hive Queries), Oozie, Shell scripting, GIT, Splunk As a Big Data Engineer, you will: - Work on an external project and develop an application tha ..read more
Visit website
Is there a way to access hadoop via eclipse
Reddit » Hadoop
by /u/Darktrader21
2w ago
As the title suggests, I am new to hadoop and my instructor gave me a task to access it via eclispe, it's something called accessing it via java api. I've searched so many videos but most of them are wordcount problems and aren't solving my problem. Any suggestions? submitted by /u/Darktrader21 [visit reddit] [comments ..read more
Visit website
Cirata for Hadoop Migration
Reddit » Hadoop
by /u/whistlerbumps
1M ago
My company is exploring Cirata using a 5pb data migration to Azure. The technology (centered on Paxos algo) seems very impressive for large, unstructured datasets but I'm not sure. Does anyone have any experience using them and any thoughts they would be willing to share? Thanks in advance. submitted by /u/whistlerbumps [visit reddit] [comments ..read more
Visit website
Onprem HDFS alternatives for 10s of petabytes?
Reddit » Hadoop
by /u/rpg36
2M ago
So I see lots of people dumping on Hadoop in general in this sub but I feel a lot of the criticism is really towards YARN. I am wondering if that is also true for HDFS. Are there any onprem storage alternatives that can scale to say 50PBs or more? Is there anything else that has equal or better performance and lower disk usage with equality or better resiliency especially factoring in HDFS erasure coding with roughly 1.5x size on disk? Just curious what others are doing for storing large amounts of semi structured data in 2024. Specifically I'm dealing with a wide variety of data ranging from ..read more
Visit website
HIVE HELP NEEDED !!!
Reddit » Hadoop
by /u/TopGrandGearTour
2M ago
Hi guys its my first time using hive and I just set it up using a udemy course guideline. I got this error that reads schema too failde due to hive exception. Error: Syntax error: Encountered "statement_timeout" at line 1, column 5. (state=42X01,code=30000) org.apache.hadoop.hive.metastore.HiveMetaException: Schema initialization FAILED! Metastore state would be inconsistent !! Underlying cause: java.io.IOException : Schema script failed, errorcode 2 Use --verbose for detailed stacktrace. *** schemaTool failed *** Can someone help me with this. I followed these stackoverflow to trouble sho ..read more
Visit website
Big Companies: Java Hadoop or Hadoop streaming
Reddit » Hadoop
by /u/Hazem_Ahmed22
2M ago
Hello all, I was wondering from your experience in the industry do big companies (in terms of market leadership not only in size) is the Java approach of writing their MapReduce jobs more popular or Hadoop Streaming approach. It would be very interesting to know to be if I need to brush up my Java skills or can stick with python streaming approach in order to prompt myself as Hadoop MapReduce practitioner/capable. submitted by /u/Hazem_Ahmed22 [visit reddit] [comments ..read more
Visit website

Follow Reddit » Hadoop on FeedSpot

Continue with Google
Continue with Apple
OR