Hello! My OS is windows 10 64bit and IDE is eclipse python. I try to install pyarrow with both pip and conda. The installation of pyarrow is successful. But my codes throw the errors like below,
This time I execute same python codes on CentOS 8, And they show the errors like below,
Any idea, please. Best regards.
1 2 |
import pyarrow as pa fs = pa.hdfs.connect(host = 'localhost' , port = 9000 ) |
Error:Traceback (most recent call last):
File "C:\eclipse-workspace\PythonFredProj\com\aaa\fred\hdfs3-test.py", line 14, in <module>
fs = pa.hdfs.connect(host='localhost', port=9000)
File "C:\Python-3.8.3-x64\lib\site-packages\pyarrow\hdfs.py", line 208, in connect
fs = HadoopFileSystem(host=host, port=port, user=user,
File "C:\Python-3.8.3-x64\lib\site-packages\pyarrow\hdfs.py", line 38, in __init__
_maybe_set_hadoop_classpath()
File "C:\Python-3.8.3-x64\lib\site-packages\pyarrow\hdfs.py", line 136, in _maybe_set_hadoop_classpath
classpath = _hadoop_classpath_glob(hadoop_bin)
File "C:\Python-3.8.3-x64\lib\site-packages\pyarrow\hdfs.py", line 163, in _hadoop_classpath_glob
return subprocess.check_output(hadoop_classpath_args)
File "C:\Python-3.8.3-x64\lib\subprocess.py", line 411, in check_output
return run(*popenargs, stdout=PIPE, timeout=timeout, check=True,
File "C:\Python-3.8.3-x64\lib\subprocess.py", line 489, in run
with Popen(*popenargs, **kwargs) as process:
File "C:\Python-3.8.3-x64\lib\subprocess.py", line 854, in __init__
self._execute_child(args, executable, preexec_fn, close_fds,
File "C:\Python-3.8.3-x64\lib\subprocess.py", line 1307, in _execute_child
hp, ht, pid, tid = _winapi.CreateProcess(executable, args,
OSError: [WinError 193] %1 is not a valid win32 application
And more!This time I execute same python codes on CentOS 8, And they show the errors like below,
Error:Traceback (most recent call last):
File "/home/jhwang/eclipse-workspace/BigDataPythonTest/com/aaa/etl/hdfs3-test.py", line 7, in <module>
fs = pa.hdfs.connect(host='localhost', port=9000)
File "/usr/python/anaconda3/lib/python3.8/site-packages/pyarrow/hdfs.py", line 208, in connect
fs = HadoopFileSystem(host=host, port=port, user=user,
File "/usr/python/anaconda3/lib/python3.8/site-packages/pyarrow/hdfs.py", line 40, in __init__
self._connect(host, port, user, kerb_ticket, extra_conf)
File "pyarrow/io-hdfs.pxi", line 75, in pyarrow.lib.HadoopFileSystem._connect
File "pyarrow/error.pxi", line 99, in pyarrow.lib.check_status
OSError: Unable to load libhdfs: ./libhdfs.so: cannot open shared object file: No such file or directory
Both errors are caused by the same reason? I have no idea what is wrong about this simple codes. Any idea, please. Best regards.