01-21-2020, 07:08 PM
(09-27-2019, 12:20 PM)hugepanic Wrote: I would like to get a PinebookPro for data science tasks under linux with python and jupyter notebook.
Are there any troubles I have to expect with this setup?
Does it matter that the PBP uses a ARM cpu?
Any libraries that will create trouble?
Or can I just "PIP install" all the stuff I need without much trouble?
Does the Linux distribution matter?
Thx
I'm running Manjaro's image on my eMMC with Jupyter, Pandas, Sympy, Plotly, Numpy, Scikit-learn, plotly, matplotlib, statsmodels, tqdm, boto3, pyserial, pytz, flask (testing APIs for data transformations), and other libraries installed on Python 3.8. My only trouble right now is pandoc doesn't install so I can't export my notebooks to PDF for non-jupyter people.
For quickly working with small datasets, it's fine, but I use Google Collab for higher performance computing (it's free, has GPUs/CUDA) from my laptop, and eventually will setup a ssh tunnel to my desktop for bigger jobs.
I work mostly with time series sensor data and lab data from controlled experiments and environmental monitoring applications. Nothing below 1 sample per 10 seconds (10 minute average datapoint interval) on hours to months worth of data -- so I can use the PBP as a daily driver, as it is not much slower than my MacBook Pro for these tasks (5x as long on sub 1 second data transformations isn't terribly noticeable).
Memory is a problem however, as Chromium is the best performing browser, which eats away at your 4GB pretty quick. I run Arglebargle's zram-swap service, which helps.