Categories
Misc

Conjugating verbs using NN?

Hey,

I wanted to know if it is possible to conjugate verbs, assuming a regular pattern, using some form of NN. Essentially, I want to input a verb, in Arabic, and the output is the root form of the verb; e.g. “running” → “run”, or

“كَتَبَ” ← “يَكْتُب”.

I think I have a few challenges in this:

  1. Identify the current form of the verb, can be done using labels, I think…
  2. Transform that form into a different form, e.g. from present to past.

The second part is the one I am not sure about. I couldn’t find any information about relating text or words together under different labels. All I found was sentiment labels, and captioning images, which I don’t think necessarily solves my issue.

Any resources about something like this? Or anyone has any insight?

I am fairly new to TensorFlow, I don’t know where to necessarily look for answers, so any advice on where I should begin any research for future ideas would be greatly appreciated!

submitted by /u/Muscle_Man1993
[visit reddit] [comments]

Categories
Misc

Question about information loss when reducing image size.

submitted by /u/Senior1292
[visit reddit] [comments]

Categories
Misc

Battery charge prediction

Hey guys, ML beginner here.

I’m looking into creating an ML model that predicts how long it will take to fully charge a battery (similar to what Android smartphones has on their lock screen when charging).

I basically want to give the model the current power (in Watts) used to charge, current battery capacity (in mAh), full battery capacity (in mAh) and get the remaining time as the output.

How would I go about that? What kind of data / how much of it would I have to get?

submitted by /u/LGariv
[visit reddit] [comments]

Categories
Misc

What is the fastest way to learn Tensorflow?

I don’t expect to become an expert. Not even close.

  1. Can you learn TF in 5-7 days to create an image classification or computer vision app?
  2. Is going through tutorials and guide on https://www.tensorflow.org/ a good idea to accomplish that?
  3. What is the fastest way to learn TF?

submitted by /u/margot309
[visit reddit] [comments]

Categories
Misc

RAPIDS Accelerator for Apache Spark Release v21.10

This post details the latest functionality of RAPIDS Accelerator for Apache Spark.

RAPIDS Accelerator for Apache Spark v21.10 is now available! As an open source project, we value our community, their voice, and requests. This release constitutes community requests for operations that are ideally suited for GPU acceleration. 

Important callouts for this release:

  • Speed up – performance improvements and cost savings.
  • New Functionality – new I/O and nested datatype Qualification and Profiling tool features. 
  • Community Updates – updates to the spark-examples repository.

Speed up

RAPIDS Accelerator for Apache Spark is growing at a great pace in both functionality and performance. Standard industry benchmarks are a great way to measure performance over a period of time but another barometer to measure performance is to measure performance of common operators that are used in the data preprocessing stage or in data analytics.

We used four such queries shown in the chart below:

  • Count Distinct: a function used to estimate the number of unique page views or unique customers visiting an e-commerce site.
  • Window: a critical operator necessary for preprocessing components in analyzing timestamped event data in marketing or financial industry.
  • Intersect: an operator used to remove duplicates in a dataframes.
  • Cross-join: A common use for a cross join is to obtain all combinations of items.

These queries were run on a Google Cloud Platform (GCP) machine with 2xT4 GPUs each with 104GB RAM. The dataset used was of size 3TB with multiple different data types. More information about the setup and the queries can be found in the spark-rapids-examples repository on GitHub. These four queries show not only performance and cost benefits but also the range of speed-up (27x to 1.5x) varies depending on compute intensity. These queries vary in compute and network utilization similar to a practical use case in data preprocessing.

A bar chart showing GPU vs CPU runtime for four microbenchmarks (Apache Spark Operators) 1. Cross-join 2. Intersect 3.Windowing (with & without data skew) 4.Count Distinct..
  The preceding graph is a little sneak peek into the speed-up one can expect while using Spark-Rapids. A detailed performance analysis will be provided in the next release blog.
Figure 1: Microbenchmark Queries runtime on Google Cloud Platform Dataproc Cluster: GPU vs CPU.

New functionality

Plug-in

Most Apache Spark users are aware that Spark 3.2 was released this October. The v21.10 release has support for Spark 3.2 and CUDA 11.4. In this release, we focused on expanding support for I/O, nested data processing and machine learning functionality. RAPIDS Accelerator for Apache Spark v21.10 released a new plug-in jar to support machine learning in Spark. 

Currently, this jar supports training for the Principal Component Analysis algorithm. The ETL jar extended the input type support for Parquet and ORC. It now also provides users with the functionality to use HashAggregate, Sort, Join SHJ and Join BHJ on nested data. In addition to support for nested datatypes a performance test was also run.

In the figure below, we show that the speed-up is observed for two queries using nested data type input. Some other interesting features that were added in v21.10 are, pos_explode, create_map and so on. Please refer to RAPIDS Accelerator for Apache Spark’s documentation for a detailed list of new features.

A bar chart showing GPU vs CPU runtime for two microbenchmarks (Apache Spark Operators) 1. Count Distinct 2. Windowing.
Figure 2: Microbenchmark Queries runtime for nested datatypes on Google Cloud Platform Dataproc Cluster: GPU vs CPU.

Profiling & qualification tool

In addition to the plug-in, multiple new features were also added to RAPIDS Accelerator for Apache Spark’s Qualification and Profiling tool. The Qualification tool can now report the different nested datatypes and write data formats present. It now also includes support for adding conjunction and disjunction filters, and filter based Regular Expressions and usernames.

The Qualifications tool is not the only one with new tricks: the Profiling tool now provides structured output format and support to scale and run a large number of event logs.

Community updates

We are excited to announce that we are in public preview on Azure and we welcome Azure users to try RAPIDS Accelerated for Apache Spark on Azure Synapse.

We invite you to view our talks presented at NVIDIA’s flagship event, GTC, held from Nov. 8-11, to learn how AI is transforming the world. The RAPIDS Accelerator team presented two talks; Accelerating Apache Spark gives an overview of new functionality and other upcoming features. Also, Discover Common Apache Spark Operations Turbocharged with RAPIDS and NVIDIA GPUs covers many microbenchmarks on Apache Spark.

Coming soon

The upcoming versions will introduce support for 128-bit decimal datatype, inference support for the Principle Component Analysis algorithm and additional nested data type support for multi-level struct and maps. 

In addition, lookout for MIG support for NVIDIA Ampere Architecture based GPUs (A100/A30) which can help improve throughput on running multiple spark jobs with A100. As always, we want to thank all of you for using RAPIDS Accelerator for Apache Spark and we look forward to hearing from you. Reach out to us on GitHub and let us know how we can continue to improve your experience using RAPIDS Accelerator on Apache Spark.

Categories
Misc

Prepare for Genshin Impact, Coming to GeForce NOW in Limited Beta

GeForce NOW is charging into the new year at full force. This GFN Thursday comes with the news that Genshin Impact, the popular open-world action role-playing game, will be coming to the cloud this year, arriving in a limited beta. Plus, this year’s CES announcements were packed with news for GeForce NOW. Battlefield 4: Premium Read article >

The post Prepare for Genshin Impact, Coming to GeForce NOW in Limited Beta appeared first on The Official NVIDIA Blog.

Categories
Misc

Get Started on NVIDIA Triton with an Introductory Course from NVIDIA DLI

Practice machine learning operations and learn how to deploy your own machine learning models on a NVIDIA Triton GPU server.

Deploying a Model for Inference at Production Scale

A lot of love goes into building a machine-learning model. Challenges range from identifying the variables to predict to experimentation finding the best model architecture to sampling the correct training data. But, what good is the model if you can’t access it?

Enter the NVIDIA Triton Inference Server. NVIDIA Triton helps data scientists and system administrators turn the same machines you use to train your models into a web server for model prediction. While a GPU is not required, an NVIDIA Triton Inference Server can take advantage of multiple installed GPUs to quickly process large batches of requests.

To get hands-on practice with a live server, the NVIDIA Deep Learning Institute (DLI) is offering a 4-hour, self-paced course titled Deploying a Model for Inference at Production Scale.

MLOps Overview

NVIDIA Triton was created with Machine Learning Operations, or MLOps, in mind. MLOps is a relatively new field evolved from Developer Operations, or DevOps, to focus on scaling and maintaining machine-learning models in a production environment. NVIDIA Triton is equipped with features such as model versioning for easy rollbacks. It is also compatible with Prometheus to track and manage server metrics such as latency and request count.

Course Information

This course covers an introduction to MLOps coupled with hands-on practice with a live NVIDIA Triton Inference Server. 

Learning objectives include:

  • Deploying neural networks from a variety of frameworks onto a live NVIDIA Triton Server.
  • Measuring GPU usage and other metrics with Prometheus.
  • Sending asynchronous requests to maximize throughput.

Upon completion, developers will be able to deploy their own models on an NVIDIA Triton Server.

Start Learning >>

For additional hands-on training visit the NVIDIA Deep Learning Institute.

Categories
Misc

Teamwork Makes AVs Work: NVIDIA and Deloitte Deliver Turnkey Solutions for AV Developers

Autonomous vehicles are born in the data center, which is why NVIDIA and Deloitte are delivering a strong foundation for developers to deploy robust self-driving technology. At CES this week, the companies detailed their collaboration, which is aimed at easing the biggest pain points in AV development. Deloitte, a leading global consulting firm, is pairing Read article >

The post Teamwork Makes AVs Work: NVIDIA and Deloitte Deliver Turnkey Solutions for AV Developers appeared first on The Official NVIDIA Blog.

Categories
Misc

John Snow Labs Spark-NLP 3.4.0: New OpenAI GPT-2, new ALBERT, XLNet, RoBERTa, XLM-RoBERTa, and Longformer for Sequence Classification, support for Spark 3.2, new distributed Word2Vec, extend support to more Databricks & EMR runtimes, new state-of-the-art transformer models, bug fixes, and lots more!

John Snow Labs Spark-NLP 3.4.0: New OpenAI GPT-2, new ALBERT, XLNet, RoBERTa, XLM-RoBERTa, and Longformer for Sequence Classification, support for Spark 3.2, new distributed Word2Vec, extend support to more Databricks & EMR runtimes, new state-of-the-art transformer models, bug fixes, and lots more! submitted by /u/dark-night-rises
[visit reddit] [comments]
Categories
Misc

Evaluating in TensorFlow Object Detection API – AttributeError

Hello everyone,

I have successfully trained my model using the TensorFlow Object Detection API and wanted to evaluate it on it. I used the following site as a guide: https://tensorflow-object-detection-api-tutorial.readthedocs.io/en/latest/training.html

Link for the code “model_main_tf2.py”: https://github.com/tensorflow/models/blob/master/research/object_detection/model_main_tf2.py

After running the script “model_main_tf2.py”, I received the following error message:

-> INFO:tensorflow:Waiting for new checkpoint at models/my_ssd_resnet50_v1_fpn -> I1220 17:06:56.024288 140351537808192 checkpoint_utils.py:140] Waiting for new checkpoint at models/my_ssd_resnet50_v1_fpn -> INFO:tensorflow:Found new checkpoint at models/my_ssd_resnet50_v1_fpn/ckpt-2 -> I1220 17:06:56.024974 140351537808192 checkpoint_utils.py:149] Found new checkpoint at models/my_ssd_resnet50_v1_fpn/ckpt-2 -> 2021-12-20 17:06:56.098253: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:185] None of the MLIR Optimization Passes are enabled (registered 2) -> /home/ameisemuhammed/anaconda3/envs/tensorflow/lib/python3.9/site-packages/keras/backend.py:401: UserWarning: tf.keras.backend.set_learning_phase is deprecated and will be removed after 2020-10-11. To update it, simply pass a True/False value to the `training` argument of the `call` method of your layer or model. -> warnings.warn(‘`tf.keras.backend.set_learning_phase` is deprecated and ‘ -> 2021-12-20 17:07:08.993353: I tensorflow/stream_executor/cuda/cuda_dnn.cc:369] Loaded cuDNN version 8204 -> Traceback (most recent call last): -> File “/home/ameisemuhammed/TensorFlow/workspace/training_demo/model_main_tf2.py”, line 114, in <module> tf.compat.v1.app.run() -> File “/home/ameisemuhammed/anaconda3/envs/tensorflow/lib/python3.9/site-packages/tensorflow/python/platform/app.py”, line 40, in run _run(main=main, argv=argv, flags_parser=_parse_flags_tolerate_undef) -> File “/home/ameisemuhammed/anaconda3/envs/tensorflow/lib/python3.9/site-packages/absl/app.py”, line 303, in run _run_main(main, args) -> File “/home/ameisemuhammed/anaconda3/envs/tensorflow/lib/python3.9/site-packages/absl/app.py”, line 251, in _run_main sys.exit(main(argv)) -> File “/home/ameisemuhammed/TensorFlow/workspace/training_demo/model_main_tf2.py”, line 81, in main model_lib_v2.eval_continuously( -> File “/home/ameisemuhammed/anaconda3/envs/tensorflow/lib/python3.9/site-packages/object_detection/model_lib_v2.py”, line 1141, in eval_continuously optimizer.shadow_copy(detection_model) -> File “/home/ameisemuhammed/anaconda3/envs/tensorflow/lib/python3.9/site-packages/keras/optimizer_v2/optimizer_v2.py”, line 830, in getattribute raise e -> File “/home/ameisemuhammed/anaconda3/envs/tensorflow/lib/python3.9/site-packages/keras/optimizer_v2/optimizer_v2.py”, line 820, in getattribute return super(OptimizerV2, self).getattribute(name) -> AttributeError: ‘SGD’ object has no attribute ‘shadow_copy’“

My versions:

TensorFlow = 2.6.0
TensorFlow GPU = 2.6.0
Ubuntu = 20.04
Python = 3.9.7
GPU = NVIDIA Corporation TU104 [GeForce RTX 2080]
CUDA = 11.5
cuDNN = 8.2.4

Could the problem be from the TensorFlow version or CUDA/cuDNN version?
If more information is needed, please let me know!

submitted by /u/GT_King0895
[visit reddit] [comments]