Author Topic: On error resume next in Python from concurrent temperature sweeps and measuremen (Read 1020 times)

aronake · « **on:** February 02, 2024, 12:54:35 pm »

Hi,

I use an Agilent E5810A to control a number of instruments. Control programming usually done in Python with vxi11.

It is not too uncommon I run one python program to collect measurements and one to control temperature sweeps. Sometimes a couple of python programs to do some different measurements.

Once in a while I get Error vxi11.vxi11.Vxi11Exception: 15: IO timeout [read] and a program stops. I suspect reason for this is that two communications on the GPIB bus collide.

How can this be solved?

A not so elegant solution would be to just to on error resume next. This could mean that i miss a temperature step once in a while or collection of a data point. typically this is not so critical. How is this done in python?

Or is there some on error retry that is not too complicated to program?

Or other better solution to this problem?

voltsandjolts · « **Reply #1 on:** February 02, 2024, 01:13:55 pm »

To ignore that exception, maybe...

Code: [Select]

try:
    something
except Vxi11Exception:
    pass

alm · « **Reply #2 on:** February 02, 2024, 01:28:32 pm »

Indeed, as voltsandjolts shows, by catching the exception and either doing nothing or printing a warning.

A better solution would be to prevent the error from occurring. In the case of timeouts the solution is either to increase the vxi-11 timeout or make the commands take shorter. For increasing the timeout, if you're using the Python VXI-11 module directly (as opposed to through PyVISA or python-ivi), something like:

Code: [Select]

instrument.timeout = 60

Should help.

To make commands faster it can help to switch from waiting until a slow command is finished to starting the command, then periodically checking the status registers if the command is done, and only fetching the results when you know they're ready. Alternatively triggering the command, then sleeping for some time, and then fetching the results can also work. For this you need to study the specific instrument if it supports that. SCPI DMMs do via initiate and fetch? commands.

aronake · « **Reply #3 on:** February 03, 2024, 09:31:09 am »

Quote from: voltsandjolts on February 02, 2024, 01:13:55 pm

To ignore that exception, maybe...

Code: [Select]
try: something except Vxi11Exception: pass

Hi! Thanks! This is the correct answer to my question, but what i forgot to write is that i would want some way to deal with any except Vxi11Exception in a certain way (some initial declaration or something), not having to add code for each case where there is a Vxi11 read or write. Any way to do this, or Python do not allow for such lazy and bad programming?

aronake · « **Reply #4 on:** February 03, 2024, 09:41:01 am »

Quote from: alm on February 02, 2024, 01:28:32 pm

Indeed, as voltsandjolts shows, by catching the exception and either doing nothing or printing a warning.

A better solution would be to prevent the error from occurring. In the case of timeouts the solution is either to increase the vxi-11 timeout or make the commands take shorter. For increasing the timeout, if you're using the Python VXI-11 module directly (as opposed to through PyVISA or python-ivi), something like:
Code: [Select]
instrument.timeout = 60Should help.

To make commands faster it can help to switch from waiting until a slow command is finished to starting the command, then periodically checking the status registers if the command is done, and only fetching the results when you know they're ready. Alternatively triggering the command, then sleeping for some time, and then fetching the results can also work. For this you need to study the specific instrument if it supports that. SCPI DMMs do via initiate and fetch? commands.

Removing the source of the error is a very good advice!

I am using non-blocking reads for anything that takes longer time, like collecting average of 50 reading at 10 NPLC on 3458a by checking read_stb with some suitable recurrence say every 10th second. Other things i do typically are very quick things, like reading TEMP? on the 3458a or setting temperature or reading temperature of power on an ILX lightwave tec controller. Then occasionally on reads this error happens. And the higher frequency I read or write things from two python programs at the same time, the more likely these errors are to occur.

So it seems to me it is not that the GPIB bus is blocked that cause the error, which could be solved with longer time outs (or better use of the GPIB bus) but that the two python programs issue a GPIB transaction at the same time and they collide. I wonder how the E5810 responds and how python VXI11 handle the case of a busy bus and a new transaction initiated from python. I would have thought the E5810 would have responded with bus busy, and the python VXI11 would just have waited until bus available, but it may be that it then just throws an error.

voltsandjolts · « **Reply #5 on:** February 03, 2024, 09:41:37 am »

Quote from: aronake on February 03, 2024, 09:31:09 am

Quote from: voltsandjolts on February 02, 2024, 01:13:55 pm
To ignore that exception, maybe...

Code: [Select]
try: something except Vxi11Exception: pass

Hi! Thanks! This is the correct answer to my question, but what i forgot to write is that i would want some way to deal with any except Vxi11Exception in a certain way (some initial declaration or something), not having to add code for each case where there is a Vxi11 read or write. Any way to do this, or Python do not allow for such lazy and bad programming?

Write your own read and write functions, which do nothing more than have that exception wrapper in them.

alm · « **Reply #6 on:** February 03, 2024, 10:14:09 am »

Quote from: aronake on February 03, 2024, 09:41:01 am

I am using non-blocking reads for anything that takes longer time, like collecting average of 50 reading at 10 NPLC on 3458a by checking read_stb with some suitable recurrence say every 10th second. Other things i do typically are very quick things, like reading TEMP? on the 3458a or setting temperature or reading temperature of power on an ILX lightwave tec controller. Then occasionally on reads this error happens. And the higher frequency I read or write things from two python programs at the same time, the more likely these errors are to occur.

I have been running multiple parallel scripts (obviously not for the same instrument) on VXI-11 a lot.Mostly using HP E2050A but also E5810A. I do increase the timeout especially with the 3458A on the bus, but it's extremely rare that I get an error. The Keithley 2000 might give an error with a garbled response once every week or so (so I run long running scripts in a while true loop in the shell that keeps invoking the script), but I blame this on flaky Keithley GPIB implementation. I haven't seen this with HP or Datron / Wavetek equipment.

Microdoser · « **Reply #7 on:** February 03, 2024, 10:48:07 am »

Personally, I would always log the specific exception with a timestamp before moving on with the rest of the code, instead of just simply passing. Then you can check the log later and look at exactly what happened and when, and try to find ways of stopping it happening again. If nothing else, you have a record of how often it happens.

IMO It's always better to stop errors from happening in the first place if possible, than to blindly pass and carry on. Sure, your program will work, but it's not the ideal solution.

aronake · « **Reply #8 on:** February 04, 2024, 08:00:49 am »

Quote from: alm on February 03, 2024, 10:14:09 am

Quote from: aronake on February 03, 2024, 09:41:01 am
I am using non-blocking reads for anything that takes longer time, like collecting average of 50 reading at 10 NPLC on 3458a by checking read_stb with some suitable recurrence say every 10th second. Other things i do typically are very quick things, like reading TEMP? on the 3458a or setting temperature or reading temperature of power on an ILX lightwave tec controller. Then occasionally on reads this error happens. And the higher frequency I read or write things from two python programs at the same time, the more likely these errors are to occur.
I have been running multiple parallel scripts (obviously not for the same instrument) on VXI-11 a lot.Mostly using HP E2050A but also E5810A. I do increase the timeout especially with the 3458A on the bus, but it's extremely rare that I get an error. The Keithley 2000 might give an error with a garbled response once every week or so (so I run long running scripts in a while true loop in the shell that keeps invoking the script), but I blame this on flaky Keithley GPIB implementation. I haven't seen this with HP or Datron / Wavetek equipment.

Rather than just concluding there has been an error I now keep track of which piece of equipment is involved in the error. Over the last two pretty measurement intensive days, there has been two errors. Both involving the ILX Lightwave LDT-5980 Tec Controller. So may be that error is caused by some bad GPIB implementation in this, and that it does not lock the bus and this cause the errors. I will keep looking if there is some other equipment that cause errors in the comming days. The equipment i run now is 5 X 3458a, one 34790a for switching, the ILX lightwave and a DMM6500 (on lan, so not GPIB)

aronake · « **Reply #9 on:** February 04, 2024, 08:02:13 am »

Quote from: voltsandjolts on February 03, 2024, 09:41:37 am

Quote from: aronake on February 03, 2024, 09:31:09 am
Quote from: voltsandjolts on February 02, 2024, 01:13:55 pm
To ignore that exception, maybe...

Code: [Select]
try: something except Vxi11Exception: pass

Hi! Thanks! This is the correct answer to my question, but what i forgot to write is that i would want some way to deal with any except Vxi11Exception in a certain way (some initial declaration or something), not having to add code for each case where there is a Vxi11 read or write. Any way to do this, or Python do not allow for such lazy and bad programming?

Write your own read and write functions, which do nothing more than have that exception wrapper in them.

This seems to be the way forward. And add retry functionality and include error logging in these functions.

miro123 · « **Reply #10 on:** February 04, 2024, 12:42:54 pm »

You already pinpoint the root cause of the problem. It is concurrence of VXI11 transactions from concurrent running processes.
My solution - I have chosen to use RAW socket instead of any high level protocols.
Advantage - it works is all Ethernet devices so far.includion Prologix ethern dongle.
Disadvantage - Requires basic software knowledge of writing multythreaded application.
Here is an example init function - is is part of bigger class
socket_tx_function and socket_rx_function are two thered that handle the two way socket communication. Building a semaphore in both function avod any concuancy on ether socket level. Time out with error reporting is good practice to avoid deadlocks or network glitches

   def init_eth(self):
      HOST='10.0.0.89'   #prologix dongle . ToDo crete self discovery Prologix function
      PORT=1234
      buf_len=10
      self.s=socket.socket(socket.AF_INET,socket.SOCK_STREAM,socket.IPPROTO_TCP)
      a=self.s.connect((HOST,1234))
      self.eth_rx = threading.Thread(target=self.socket_rcv_function)
      self.eth_tx = threading.Thread(target=self.socket_tx_function)

      time.sleep(0.3) # start prologix specific code
      self.eth_rx.start()
      self.s.sendall(b'++addr 9\r\n')
      time.sleep(1)
      self.s.sendall(b'++auto\r\n')
      time.sleep(1)
      self.s.sendall(b'Read?\r\n')
      time.sleep(1)
      self.s.sendall(b'Read?\r\n')
      time.sleep(1)
      self.eth_tx.start()

alm · « **Reply #11 on:** February 04, 2024, 01:01:35 pm »

Quote from: miro123 on February 04, 2024, 12:42:54 pm

You already pinpoint the root cause of the problem. It is concurrence of VXI11 transactions from concurrent running processes.

This works just fine for me on Agilent E5810A and HP E2050A. Concurrent commands wait until the GPIB bus is free again, unless you reach the timeout. What hardware did you have the problems with?

aronake · « **Reply #12 on:** February 04, 2024, 02:04:31 pm »

Quote from: alm on February 04, 2024, 01:01:35 pm

Quote from: miro123 on February 04, 2024, 12:42:54 pm
You already pinpoint the root cause of the problem. It is concurrence of VXI11 transactions from concurrent running processes.
This works just fine for me on Agilent E5810A and HP E2050A. Concurrent commands wait until the GPIB bus is free again, unless you reach the timeout. What hardware did you have the problems with?

Thanks for confirming. Thats how I thought it was supposed to work out.

I previously just took notice there was an error happening and not what device. But since starting to dig into this, it seems all error are related to an ILX Lightwave tec controller. So my suspicion is that thisone have a not to great GPIB implementation

more here:
https://www.eevblog.com/forum/metrology/on-error-resume-next-in-python-from-concurrent-temperature-sweeps-and-measuremen/msg5314045/#msg5314045


EEVblog Main Site	EEVblog on Youtube	EEVblog on Twitter	EEVblog on Facebook	EEVblog on Odysee

EEVblog Electronics Community Forum

Author Topic: On error resume next in Python from concurrent temperature sweeps and measuremen (Read 1020 times)

aronake

On error resume next in Python from concurrent temperature sweeps and measuremen

voltsandjolts

Re: On error resume next in Python from concurrent temperature sweeps and measuremen

alm

Re: On error resume next in Python from concurrent temperature sweeps and measuremen

aronake

Re: On error resume next in Python from concurrent temperature sweeps and measuremen

aronake

Re: On error resume next in Python from concurrent temperature sweeps and measuremen

voltsandjolts

Re: On error resume next in Python from concurrent temperature sweeps and measuremen

alm

Re: On error resume next in Python from concurrent temperature sweeps and measuremen

Microdoser

Re: On error resume next in Python from concurrent temperature sweeps and measuremen

aronake

Re: On error resume next in Python from concurrent temperature sweeps and measuremen

aronake

Re: On error resume next in Python from concurrent temperature sweeps and measuremen

miro123

Re: On error resume next in Python from concurrent temperature sweeps and measuremen

alm

Re: On error resume next in Python from concurrent temperature sweeps and measuremen

aronake

Re: On error resume next in Python from concurrent temperature sweeps and measuremen

Share me